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(54) TiUe: PROTEINS 

(57) Abstract: The present invention relates to a method of producing a variant glycolipid acyl transferase enzyme comprising: (a) 
selecting a parent enzyme which is a glycolipid acyl transferase enzyme characterised in that the enzyme comprises the amino acid 
sequence motif GDSX, wherein X is one or more of the following amino acid residues L, A, V, I, F, Y, H, Q, T, N, M or S; (b) 
modifying one or more amino acids to produce a variant glycolipid acyl transferase; (c) testing the variant glycolipid acyltransferase 
for transferase activity, an optionally hydrolytic activity, on a galactolipid substrate, and optionally a [>hospho]ii7id substrate and/or 
optionally a triglyceride substrate; (d) selecting a variant enzyme with an enhanced activity towards galactolipids compared with the 
parent enzyme; and optionally (e) preparing a quantity of the variant enzyme. The present invention further relates to variant lipid 
acyltransferase enzyme characterised in that lie enzyme comprises the amino acid sequence motif GDSX, wherein X is one or more 
of the foUowing amino acid residues L, A, V, I, F. Y, H, Q, T, N, M or S, and wherein the variant enzyme comprises one or more 
amino acid modifications compared with a parent sequence at any one or more of the amino acid residues defined in set 2, set 4, set 
6 or set 7. 
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PROTEINS 

REFERENCE TO RELATED APPLICATIONS 

Reference is made to the following related applications: United States Application 
5 Serial Number 09/750,990 filed on 20 July 1999; United States Application Serial 
Number 10/409,391; United States Application Serial Number 60/489,441 filed on 23 
July 2003; United Kingdom Application Number GB 0330016.7 filed on 24 December 
2003 and International Patent Application Number PCT/IB2004/000655 filed on 15 
January 2004. Each of these applications and each of the documents cited in each of 
10 these applications ("application cited documents"), and each document referenced or 
cited in the application cited documents, either in the text or during the prosecution of 
those applications, as well as all arguments in support of patentability advanced during 
such prosecution, are hereby incorporated herein by reference. Various documents are 
also cited in this text ("herein cited documents"). Each of the herein cited documents, 
15 and each document cited or referenced in the herein cited documents, is hereby 
incorporated herein by reference. 

FIELD OF INVENTION 

20 The present invention relates to methods of producing variant enzymes. The present 
invention further relates to novel variant enzymes and to the use of these novel variant 
enzymes. 

TECHNICAL BACKGROUND 

25 

Lipidicholesterol acyltransferase enzymes have been known for some time (see for 
example Buckley - Biochemistry 1983, 22, 5490-5493). In particular, 
glycerophosphoHpidxholesterol acyl transferases (GCATs) have been foxmd, which 
like the plant and/or mammalian lecithinxholesterol acyltransferases (LCATs), will 
30 catalyse fatty acid transfer between phosphatidylcholine and cholesterol. 
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Upton and Buckley (TIBS 20, May 1995, pl78-179) and Brumlik and Buckley (J. of 
Bacteriology Apr. 1996, p2060-2064) teach a lipase/acyltransferase from Aeromonas 
Irydrophila which has the ability to carry out acyl transfer to alcohol receptors in 
aqueous media. 

5 

A putative substrate binding domam and active site of the A. hydrophila 
acyltransferase have been identified (see for example Thornton et al 1988 Biochem. et 
Biophys. Acta 959, 153-159 and HUton & Buckley 1991 J. Biol. Chem. 266, 997- 
1000) for this enzyme. 

10 

Buckley et al (J. Bacteriol 1996, 178(7) 2060-4) taught that Serl6, Aspl 16 and His291 
are essential amino acids which must be retained for enzyme activity to be maintained. 

Robertson et al (J. Biol. Chem. 1994, 269, 2146-50) taught some specific mutations, 
15 namely Y226F, Y230F, Y30F, F13S, S18G, S18V, of the A. hydrophila 
acyltransferase, none of which are encompassed by the present invention. 

SUMMARY ASPECTS OF THE PRESENT INVENTION 

20 The present invention is predicated upon the finding of specific variants of a GDSx 
containing lipid acyltransferase enzyme, which variants have an increased transferase 
activity compared with a parent enzyme. In particxilar, the variants according to the 
present invention have an enhanced transferase activity using galactolipid as an acyl 
donor as compared with a parent en2yme. These lipid acyltransferases are referred to 

25 herein as glycolipid acyltransferases. The variants according to the present invention 
may additionally have an enhanced ratio of transferase activity using galactolipids as 
an acyl donor as compared with phosphohpid transferase activity (GL:PL ratio) and/or 
an enhanced ratio of transferse activity using galactolipids as an acyl donor as 
compared with galactolipid hydrolysis activity (GLt:GLh ratio) compared with a 

30 parent enzyme. 
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According to a first aspect the present invention provides a method of producing a 
variant glycolipid acyltransferase enzyme comprising: (a) selecting a parent enzyme 
which is a lipid acyltransferase enzyme characterised in that the enzyme comprises the 
amino acid sequence motif GDSX, wherein X is one or more of the following amino 
5 acid residues L, A, V, I, F, Y, H, Q, T, N, M or S; (b) modifying one or more amino 
acids to produce a variant lipid acyltransferase; (c) testing the variant lipid 
acyltransferase for activity on a galactolipid substrate, and optionally a phospholipid 
substrate and/or optionally a triglyceride substrate; (d) selecting a variant enzyme with 
an enhanced activity towards galactolipids compared with the parent enzyme; and 
1 0 optionally (e) preparing a quantity of the variant enzyme. 

In another aspect the present invention provides a variant glycoUpid acyltransferase 
enzyme characterised in that the enzyme comprises the amino acid sequence motif 
GDSX, wherein X is one or more of the following amino acid residues L, A, V, I, F, 
15 Y, H, Q, T, N, M or S, and wherein the variant enzyme comprises one or more amino 
acid modifications compared with a parent sequence at any one or more of the amino 
acid residues defined in set 2 or set 4 or set 6 or set 7 (defined hereinbelow). 

In a further aspect the present invention provides a variant glycolipid acyltransferase 
20 enzyme characterised in that the enzyme comprises the amino acid sequence motif 
GDSX, wherein X is one or more of the following amino acid residues L, A, V, I, F, 
Y, H, Q, T, N, M or S, and wherein the variant enzyme comprises one or more amino 
acid modifications compared with a parent sequence at any one or more of the amino 
acid residues detailed in set 2 or set 4 or set 6 or set 7 (defined hereinbelow) identified 
25 by said parent sequence being structurally aligned with the structural model of P10480 
defined herein, which is preferably obtained by structural aligranent of PI 0480 crystal 
structure coordinates with IIVNJDB and/or IDEO.PDB as taught herein. 

The present invention yet further provides a variant glycolipid acyltransferase enzyme 
30 characterised in that the enzyme comprises the amino acid sequence motif GDSX, 
wherein X is one or more of the folloAving amino acid residues L, A, V, I, F, Y, H, Q, 
T, N, M or S, and wherein the variant enzyme comprises one or more amino acid 
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modifications compared with a parent sequence at any one or more of the amino acid 
residues taught in set 2 identified when said parent sequence is aligned to the pfam 
consensus sequence (SEQ ID No. 1) and modified according to a structural model of 
PI 0480 to ensure best fit overlap (see Figure 55) as taught 

5 

According to a firrther aspect the present invention provides a variant glycolipid 
acyltransferase enzyme wherein the variant enzyme comprises an amino acid 
sequence, which amino acid sequence is shown as SEQ ID No. 2, SEQ ED No. 3, SEQ 
ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 

10 16, SEQ ID No. 18, SEQ ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, 
SEQ ID No. 28, SEQ ID No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, 
SEQ ID No. 37, SEQ ID No. 39, SEQ ID No. 41, SEQ ID No. 43, or SEQ ID No. 45 
except for one or more amino acid modifications at any one or more of the amino acid 
residues defined in set 2 or set 4 or set 6 or set 7 (hereinafter defined) identified by 

1 5 sequence aUgnment with SEQ ID No. 2. 

In a fiirther aspect the present invention provides a variant glycolipid acyltransferase 
enzyme wherein the variant enzyme comprises an amino acid sequence, which amino 
acid sequence is shown as SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 

20 5, SEQ ID No. 6, SEQ ID No. 12, SEQ ID No. 14, SEQ ED No. 16, SEQ ID No. 18, 
SEQ ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, SEQ ID No. 28, 
SEQ ID No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, SEQ ID No. 37, 
SEQ ID No. 39, SEQ ID No. 41, SEQ ED No. 43 or SEQ ID No. 45 except for one or 
more amino acid modifications at any one or more of the amino acid residues defined 

25 in set 2 or set 4 or set 6 or set 7 identiGed by said parent sequence being structurally 
aligned with the structural model of PI 0480 defiaied herein, which is preferably 
obtained by structural alignment of P10480 crystal structure coordinates with 
irVN.PDB and/or 1DEO.PDB as taught herein. 

30 According to a fiirther aspect tiie present invention provides a variant glycolipid 
acyltransferase enzyme wherein the variant enzyme comprises an amino acid 
sequence, which amino acid sequence is shown as SEQ ID No. 2, SEQ ID No. 3, SEQ 
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. ID No. 4, SEQ ID No. 5, SEQ ID No, 6, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 
16, SEQ ID No. 1 8, SEQ ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, 
SEQ ID No. 28, SEQ ID No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, 
SEQ ID No. 37, SEQ ID No. 39, SEQ ID No. 41, SEQ ID No. 43 or SEQ ID No. 45 
5 except for one or more amino acid modifications at any one or more of the amino acid 
residues taught in set 2 identified when said parent sequence is aligned to the pfam 
consensus sequence (SEQ ID No. 1) and modified according to a structural model of 
PI 0480 to ensure best fit overlap (see Figure 55) as taught 

10 The present invention yet fiirther provides the use of a variant glycolipolytic enzyme 
according to the present invention or obtained by a method according to the present 
invention in the manufacture of a substrate (preferably a foodstuff) to prepare a lyso- 
glycolipid, for example digalactosyl monoglyceride (DGMG) or monogalactosyl 
monoglyceride (MGMG) by treatment of a glycolipid (e.g. digalactosyl diglyceride 

15 (DGDG) or monogalactosyl diglyceride (MGDG)) with the variant lipolytic enzyme 
according to the present invention or obtained by a method according to the present 
invention to produce the partial hydrolysis product, i.e. the lyso-glycolipid. 

In a fiirther aspect, the present invention provides the use of a variant lipolytic enzyme 
20 according to the present invention or obtained by a method according to the present 
invention in the manufacture of a substrate (preferably a foodstuff) to prepare a lyso- 
phospholipid, for example lysolecithin, by treatment of a phospholipid (e.g. lecithin) 
with the variant lipolytic enzyme according to the present invention or obtained by a 
method according to the present invention to produce a partial hydrolysis product, i.e a 
25 lyso-phospholipid. 

In one aspect the present invention relates to a method of preparing a foodstuff the 
method comprising adding a variant lipolytic enzyme according to the present 
invention or obtained by a method according to the present invention to one or more 
30 ingredients of the foodstuff. 
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Another aspect of the present invention relates to a method of preparing a baked 
product from a dough, the method comprising adding a variant lipolytic enzyme 
according to the present invention or obtained by a method according to the present 
invention to the dough. 

5 

hi another aspect of the present invention there is provided the use of a variant 
lipolytic enzyme according to the present invention or obtained by a method according 
to the present mvention in the manufacture of an egg-based product for producing 
lysophospholipids. 

10 

In another aspect, there is provided a method of treating eggs or egg-based products 
comprising adding a variant lipolytic eirzyme according to the present invention to an 
egg or an egg-based product to produce a lysophospholipid. 

15 The variants of the invention may be used in a process of production of a snack food 
such as instant noodles in analogy with WO02/065854. 

The present invention relates to the use of the variant lipid acyltransferase in 
accordance with the present invention to results in a preferred technical effect or 
20 combination of technical effects in for example the foodstuff (such as those listed 
herein under Technical Effects'). 

A further aspect of the present invention provides a process of enzymatic degunaming 
of vegetable or edible oils, comprising treating the edible or vegetable oil with a 
25 variant lipolytic enzyme according to the present invention or obtained by a method 
according to the present invention so as to hydrolyse a major part of the polar lipids 
(e.g. phospholipid and/or glycoiipid). 

hi another aspect the present invention provides a process comprising treating a 
30 phospholipid so as to hydrolyse fatty acyl groups, which process comprising admixing 
said phospholipids with a variant lipolytic enzyme according to the present invention 
or obtained by a method according to the present invention. 
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In another aspect the present invention provides a process of reducing the content of a 
phospholipid in an edible oil, comprising treating the oil vnUi a variant lipolytic 
enzyme according to the present invention or obtained by a method according to the 
5 present invention so as to hydrolyse a major part of the phospholipid, and separating 
an aqueous phase containing the hydrolysed phospholipid from the oiL 

There is also provided a method of preparing a variant lipolytic enzyme according to 
the present invention or obtained by a method according to the present invention, the 
10 method comprising transforming a host cell with a recombinant nucleic acid 
comprising a nucleotide sequence coding for said variant lipolytic enzyme, the host 
cell being capable of expressing the nucleotide sequence coding for the polypeptide of 
the lipolytic enzyme, cultivating the transformed host cell under conditions where the 
nucleic acid is expressed and harvesting the variant lipolytic enzyme. 

15 

In a further aspect the present invention relates to the use of a variant lipolytic enzyme 
according to the present invention or obtained by a method according to the present 
invention in the bioconversion of polar lipids (preferably glycolipids) to make high 
value products, such as carbohydrate esters and/or protein esters and/or protein subunit 
20 esters and/or a hydroxy acid ester. 

A method of bioconverting polar lipids (preferably glycolipids) to high value products, 
which method comprises admixing said polar lipid with a variant lipolytic enzyme 
according to the present invention or obtained by a method according to the present 
25 invention. 

The present invention yet further relates to an immobilised variant Upolytic enzyme 
according to the present invention or obtained by a method according to the present 
invention. 

30 

Aspects of the present invention are presented in the claims and in the following 
commentary. 
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Other aspects concerning the nucleotide sequences which can be used in the present 
invention include: a construct comprising the sequences of the present invention; a 
vector comprising the sequences for use in the present invention; a plasmid comprising 
5 the sequences for use in the present invention; a transformed cell comprising the 
sequences for use in the present invention; a transformed tissue comprising the 
sequences for use in the, present invention; a transformed organ comprising the 
sequences for use in the present invention; a transformed host comprising the 
sequences for use in the present invention; a transformed organism comprising the 
10 sequences for use in the present invention. The present invention also encompasses 
methods of expressing the nucleotide sequence for use in the present invention using 
the same, such as expression in a host cell; including methods for transferring same. 
The present invention further encompasses methods of isolating the nucleotide 
sequence, such as isolating from a host cell. 

15 

Other aspects concerning the amino acid sequence for use in the present invention 
include: a construct encoding the amino acid sequences for use in the present iuvention; a 
vector encoding the amino acid sequences for use in the present invention; a plasmid 
encoding the amino acid sequences for use in the present invention; a transformed cell 

20 expressing the amino acid sequences for use in the present invention; a transformed tissue 
expressing the amino acid sequences for use in the present invention; a transformed organ 
expressing the amino acid sequences for use in the present invention; a transformed host 
expressing the amino acid sequences for use in the present invention; a transformed 
organism expressing flie amino acid sequences for use in the present invention. The 

25 present invention also encompasses methods of purifying the amino acid sequence for use 
in the present invention using the same, such as expression in a host cell; including 
methods of transferring same, and then purifying said sequence. 

For the ease of reference, these and fiirther aspects of the present invention are now 
30 discussed imder appropriate section headings. However, the teachings under each 
section are not necessarily limited to each particxilar section. 
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DEFINITION OF SETS 
Amino acid set 1: 

5 

Amino acid set 1 (note that these are amino acids in IIVN - Figure 57 and Figure 58.) 
Glv8. Asp9. SerlO. Leul 1, Serl2, Tyrl5, Gly44, Asp45, Thr46, Glu69, Leu70, GIv71. 
Gly72, Asn73. Asd74. GIy75, Leu76, Glnl06, DelO?, ArglOS, Leul09, ProllO, 
Tyrl 13, Phel21, Phel39, Phel40, Metl41, Tyrl45, Metl51, Asp154. Hisl57. GIyl55, 
10 nel56 . Prol58 

The highly conserved motifs, such as GDSx and catalytic residues, were deselected 
fix>m set 1 (residues underlined). For the avoidance of doubt, set 1 defines the amino 
acid residues within lOA of the central carbon atom of a glycerol in the active site of 
15 the IIVN model. 

Amino acid set 2: 

Amino acid set 2 (note that the numbering of the amino acids refers to the amino acids 
20 m the P 1 0480 mature sequence) 

Leul7, Lys22, Met23, Gly40, Asn80, Pro81, Lys82, Asn87, Asn88, Trplll, Vail 12, 
Alall4, Tyrll7, LeullS, Prol56, GIyl59, Ghil60, Asnl61, Prol62, Serl63, Alal64, 
Argl65, Serl66, Ghil67, Lysl68, Vall69, Vall70, Glul71, Alal72, Tyrl79, Hisl80, 
Asnl81, Met209, Leu210, Arg21 1, Asn215, Lys284, Met285, Gln289 and Val290. 

25 

Table of selected residues in Set 1 compared with Set 2: 


rVN model 
rVN A.hyd homologue 

P10480 

Mature sequence Residue 
Number 


PFAM 

Structure 

Gly8 

Gly32 
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Asp9 

Asp33 


1 

oerlU 

ISerJ4 



Leul 1 

Leu35 


Leul7 

berlz 

!Ser3o 


beri5 




T -.r«00 

Lys22 




Met23 

Tyrl5 

GIy58 


Gly40 

Gly44 

Asn98 


AsnoO 

Asp45 

Pro99 


Pro81 

Thr46 

LyslOO 


Lys82 




Asn87 




AsnSS 

Glu69 

Trpl29 


Trplll 

Leu70 

Vall30 


Vail 12 

Gly71 

Glyl31 



Gly72 

Alal32 


Alall4 

Asn73 

Asnl33 



Asp74 

Aspl34 



Gly75 

Tyrl35 


Tyrll7 

Leu76 

Leul 36 


Leul 18 

Glnl06 


Pro 174 

Prol56 

Ilel07 


Glyl77 

Glyl59 

ArglOS 


Glnl78 

Glnl60 

Leul 09 


Asnl79 

Asnl61 

ProllO 


180 to 190 

Prol62 

Tvrl 1^ 







Alal64 




Argl65 




Serl66 




Glnl67 




Lysl68 
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Vail 69 




Vail 70 




Glul71 




Alal72 

Phel21 

TT* -f f\n 

His 198 

Tyrl97 

Tyrl79 



His! 98 

Hisl80 



Asnl99 

AsnlSl 

Phel39 

Met227 


Met209 

Phel40 

Leu228 


Leu210 

Metl41 

Arg229 


Arg211 

Tyrl45 

Asn233 


Asn215 




Lys284 

MetlSl 

Met303 


Met285 

Asnl54 




Glyl55 

Gln307 


Glii289 

Ilel56 

Val308 


Val290 

Hisl57 

His309 



Pro 158 

Pro310 




Amino acid set 3: 

5 Amino acid set 3 is identical to set 2 but refers to the Aeromonas salmonicida (SEQ DD 
No. 28) coding sequence, i.e. the amino acid residue numbers are 18 higher in set 3 as 
this reflects the difference between the amino acid numbering in the mature protein 
(SEQ ID No. 2) compared with the protein including a signal sequence (SEQ ID No. 
28). 


10 
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The mature proteins of Aeromonas salmonicida GDSX (SEQ ID No. 28) and 
Aeromonas hydrophila GDSX (SEQ ID No. 26) differ in five amino acids. These are 
Thr3Ser, Ghil82Lys, Glu309Ala, SerSlOAsn, Gly318-, where the salmonicida residue 
is listed first and the hydrophila residue is listed last (FIGURE 59). The hydrophila 
5 protein is only 317 amino acids long , and lacks a residue in position 318. The 
Aeromonas salmonicidae GDSX has considerably high activity on polar lipids such as 
galactolipid substrates than the Aeromonas hydrophila protein. Site scaiming was 
performed on all five amino acid positions. 

1 0 Amino acid set 4: 

Amino acid set 4 is S3, Q182, E309, S3 10, and -318. 

Amino acid set 5: 

15 

F13S, D15N, S18G, SI 8V, Y30F, Dl 16N, Dl 16E, D157 N, Y226F, D228N Y230R 
Amino acid set 6: 

20 Amino acid set 6 is Ser3, Leul7, Lys22, Met23, Gly40, AsnSO, Pro81, Lys82, Asn 87, 
Asn88, Trplll, Valll2, Alall4,Tyrll7,Leull8, Prol56, Glyl59, Ghil60, Asnl61, 
Prol62, Serl63, Alal64, Argl65, Serl66, Glnl67, Lysl68, Vall69, Vall70, Glul71, 
Alal72, Tyrl79, Hisl80, AsnlSl, Ghil82, Met209, Leu210, Arg211, Asn215, 
Lys284, Met285, Ghi289, Val290, GIu309, Ser310, -318. 

25 

The numbering of the amino acids in set 6 refers to the amino acids residues in PI 0480 
(SEQ ID No. 2) - corresponding amino acids in other sequence backbones can be 
determined by homology alignment and/or structural alignment to P10480 and/or 
IIVN. 


30 
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Amino acid set 7: 

Amino acid set 7 is Ser3, Leul7, Lys22, Met23, Gly40, AsnSO, Pro81, Lys82, Asn 87, 
Asn88, Tiplll,VaI112,Alall4,Tyrll7,Leull8, Prol56, Glyl59, Glnl60, AsnI61, 
5 Prol62, Series, AIal64, Axgl65, Serl66, Glnl67, Lysl68, Vall69, Vall70, Glul71, 
Alal72, Tyrl79, Hisl80, AsnlSl, Glnl82, Met209, Leu210, Arg211, Asn215, 
Lys284, Mel285, Glii289, Val290, Glu309, Ser310, -318, Y30X (where X is selected 
from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), Y226X (where X is 
selected fiwm A. C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), Y230X (where 
10 X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), S18X 
(where X is selected from A, C, D, E, F, H, I, K, L, M, N, P, Q, R, T, W or Y), D157X 
(where X is selected from A, C, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y). 

The nirnibering of the amino acids in set 7 refers to the amino acids residues in P10480 
15 (SEQ ID No. 2) — corresponding amino acids in other sequence backbones can be 
determined by homology alignment and/or structural alignment to PI 0480 and/or 
IIVN). 

20 DETAILED ASPECTS OF THE PRESENT INVENTION 

Preferably, the parent lipid acyltransferase enzyme comprises any one of the following 
amino acid sequences: SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, 
SEQ ID No. 6, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 1 6, SEQ ID No. 1 8, SEQ 

25 ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, SEQ ID No. 28, SEQ ID 
No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, SEQ ID No. 37, SEQ ID No. 
39, SEQ ID No. 41, SEQ ID No. 43 or SEQ ID No. 45 or an amino acid sequence 
which has 75% or more identity with any one of the sequences shown as SEQ ID No. 
2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 12, SEQ 

30 ID No. 14, SEQ ID No. 16,. SEQ ID No. 1 8, SEQ ID No. 20, SEQ ID No. 22, SEQ ID 
No. 24, SEQ ID No. 26, SEQ ID No. 28, SEQ ID No. 30, SEQ ID No. 33, SEQ ID No. 
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34, SEQ ID No. 36, SEQ ID No. 37, SEQ ID No. 39, SEQ ID No. 41, SEQ ID No. 43 
orSEQIDNo.45. 

Suitably, the parent lipid acyltransferase enzyme according to tbe present invention 
5 comprises an amino acid sequence which has at least 80%, preferably at least 85%, 
more preferably at least 90%, more preferably at least 95%, more at least 98% 
homology with any one of the sequences shown as SEQ ID No. 2, SEQ ID No. 3, SEQ 
ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ED No. 12, SEQ ID No. 14, SEQ ID No. 
16, SEQ ID No; 18, SEQ ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, 
10 SEQ ID No. 28, SEQ ID No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, 
SEQ ID No. 37, SEQ ID No. 39, SEQ ID No. 41, SEQ ID No. 43 or SEQ ID No. 45. 

Suitably, the parent lipid acyltransferase enzyme may be encoded by any one of the 
following nucleotide sequences: SEQ ID No. 7, SEQ ID No. 8, SEQ ID No. 9, SEQ ID 

15 No. 10, SEQ ID No. 11 , SEQ ID No. 13, SEQ ID No. 1 5, SEQ ID No. 17, SEQ ID No. 
19, SEQ ID No. 21, SEQ ID No. 23, SEQ ID No. 25, SEQ ID No.27, SEQ ID No. 29, 
SEQ ID No. 31, SEQ ID No. 32, SEQ ID No. 35, SEQ ID No. 38, SEQ ID No. 40, 
SEQ ID No. 42, SEQ ID No. 44 or SEQ ID No. 46 or a nucleotide sequence which has 
at least 75% or more identity with any one of the sequences shown as SEQ ID No. 7, 

20 SEQ ID No. 8, SEQ ID No. 9, SEQ ID No. 10, SEQ ID No. 11 , SEQ ID No. 13, SEQ 
ID No. 15, SEQ ID No. 17, SEQ BD No. 19, SEQ ID No. 21, SEQ ID No. 23, SEQ ED 
No. 25, SEQ BD No.27, SEQ ID No. 29, SEQ ID No. 31, SEQ ID No. 32, SEQ ID No. 

35, SEQ ID No. 38, SEQ ID No. 40, SEQ ID No. 42, SEQ ID No. 44 or SEQ ID No. 
46. 

25 

Suitably, the nucleotide sequence may have 80% or more, preferably 90% or more, 
more preferably 95% or more, even more preferably 98% or more identity with any 
one of the sequences shown as SEQ ID No. 7, SEQ ID No. 8, SEQ ID No. 9, SEQ ID 
No. 10, SEQ BD No. 11, SEQ BD No. 13, SEQ ID No. 15, SEQ BD No. 17, SEQ ID No. 
30 19, SEQ BD No. 21, SEQ ID No. 23, SEQ BD No. 25, SEQ BD No.27, SEQ ID No. 29, 
SEQ BD No. 31, SEQ BD No. 32, SEQ E) No. 35, SEQ ID No. 38, SEQ ID No. 40, 
SEQ BO No. 42, SEQ ID No. 44 or SEQ ID No. 46. 
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Preferably, the parent enzyme is modified at one or more of the amino acid residues 
defined in set 2 or set 4 or set 6 or set 7 when aligned to the reference sequence (SEQ 
ID No. 2) or structurally aligned to the structural model of P10480, or aligned to the 
5 pfam consensus sequence and modified according to the structural model of P10480. 

Suitably the variant enzyme may have an enhanced ratio of activity on galactolipids 
compared with the activity on either phospholipids and/or triglycerides when 
compared with the parent enzyme. 

10 

Suitably, the method according to the present invention may comprise testing the 
variant lipid acyltransferase for: 

(i) transferase activity firom a galactolipid substrate, and 

(ii) transferase activity from a phospholipids substrate; and 

15 selecting a variant enzyme, which when compared with the parent enzyme, has an 
enhanced ratio of transferase activity from galactolipids compared with phospholipids. 

Suitably, the ratio of transferase activity from galactolipids compared with 
phospholipids of the variant enzyme according to the present invention may be at least 
20 1, at least 2, at least 3, at least 4 or at least 5. 

Suitably, the method according to the present invention may comprise testing the 
variant lipid acyltransferase for 

(a) transferase activity from a galactolipid substrate, and 
25 (b) hydrolytic activity on a galactolipid substrate; and 

selecting a variant enzyme with an enhanced ratio of transferase activity Scorn 
galactolipids compared with its hydrolytic activity on glycolipids, compared with the 
parent enzyme. 

30 Suitably, the ratio of transferase activity on galactolipids compared to hydrolytic 
activity on galactolipids may be great than 1 , at least 1 .5, at least 2, at least 4 or at least 
5. 
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An assay for determirdng the transferase and hydrolytic activities from galactolipids 
and/or phospholipids is/are taught in Example 8 for example. 

The term "enhanced activity towards galactolipids" means the enzyme has an 
5 enhanced (i.e. higher) transferase activity when the lipid acyl donor is a galactohpid 
compared with the parent enzyme (galactolipid transferase activity) and/or has an 
increased ratio of galactolipid transferase activity when compared with phospholipids 
transferase activity compared with the parent enzyme (GLt:PLt ratio) and/or has an 
increased ratio of galactolipid transferase activity when compared with galactolipid 
1 0 hydrolysis activity compared with the parent enzyme (GLt:GLh ratio). 

Suitably, the variant enzyme compared with the parent enzyme may have an increased 
galactolipid transferase activity and either the same or less galactolipid hydrolytic 
activity. In other words, suitably the variant enzyme may have a higher galactolipid 
15 transferase activity compared with its galactolipid hydrolytic activity compared with 
the parent enzyme. Suitably, the variant enzyme may preferentially transfer an acyl 
group from a galactolipid to an acyl acceptor rather than simply hydrolysing the 
galactohpid. 

20 In one embodiment, the enzyme according to the present invention may have an 
increased transferase activity towards phosphoUpids (i.e an. increased phospholipid 
transferase activity) as compared with the parent enzyme. This increased phospholipid 
transferase activity may be independent of the enhanced activity towards galactohpids. 
Suitably, however, the variant enzyme may have an increased galactolipid transferase 

25 activity and an increased phosphoUpid transferase activity. 

In one embodiment the present invention provides a variant Upid acyltransferase 
enzyme characterised in that the enzyme comprises the amino acid sequence motif 
GDSX, wherein X is one or more of the following amino acid residues L, A, V, I, F, 
30 Y, H, Q, T, N, M or S, wherein the variant has an enhanced activity towards 
phosphoUpids, preferably enhanced phospholipid transferase activity, compared with 
the parent enzyme and wherein the variant enzyme comprises one or more amino acid 
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modifications compared with a parent sequence at any one or more of the amino acid 
residues defined in set 2 or set 4 or set 6 or set 7. 

The term "modifying" as used herein means adding, substituting and/or deleting. 
5 Preferably the term "modifying" means "substituting". 

For the avoidance of doubt, when an amino acid is substituted in the parent enzyme it 
is preferably substituted with an amino acid which is diflFerent firom that originally 
found at that position in the parent enzyme thus to produce a variant enzyme. In other 
10 words, the term "substitution" is not intended to cover the replacement of an amino 
acid with the same amino acid. 

Preferably, the parent enzyme is an enzyme which comprises the amino acid sequence 
shown as SEQ ID No. 2 and/or SEQ ID No. 28. 

15 

Preferably, the variant enzyme is an enzyme which comprises an amino acid sequence, 
which amino acid sequence is shown as SEQ ID No. 2 except for one or more amino 
acid modifications at any one or more of the amino acid residues defined in set 2 or set 
4 or set 6 or set 7. 

20 

In one embodiment, preferably the variant enzyme comprises one or more amino acid 
modifications compared with the parent sequence at at least one of the amino acid 
residues defined in set 4. 

25 Suitably, the variant enzyme comprises one or more of the following amino acid 
modifications compared with the parent enzyme: 
S3 E, A, G, K, M, Y, R, P, N, T or G 
E309Q, R or A, preferably Q or R 
-318Y, H, S or Y, preferably Y. 

30 

Preferably, X of the GDSX motif is L. Thus, preferably the parent enzyme comprises 
the amino acid motif GDSL. 
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Preferably the method of producing a variant lipid acyltransferase enzyme further 
comprises one or more of the following steps: 
1) structural homology mapping or 
5 2) sequence homology alignment. 

Suitably, the structural homology mapping may comprise one or more of the foUowmg 
steps: 

i) aligning a parent sequence with a structural model (irVN.PDB) shown in 
10 Figure 52; 

ii) selecting one or more amino acid residue within a lOA sphere centred on the 
central carbon atom of the glycerol molecule in the active site (see Figure 53) 
(such as one or more of the amino acid residues defined in set 1 or set 2); and 

iii) modifying one or more amino acids selected in accordance with step (ii) in said 
1 5 parent sequence. 

In one embodiment the amino acid residue selected may reside within a 9, preferably 
within a 8, 7, 6, 5, 4, or 3 A sphere centred on the central carbon atom of the glycerol 
molecule in the active site (see Figure 53). 

20 

Suitably, the structural homology mapping may comprise one or more of the following 
steps: 

i) aligning a parent sequence with a structural model (UVN.PDB) shown in 
Figure 52; 

25 ii) selecting one or more amino acids within a lOA sphere centred on the central 
carbon atom of the glycerol molecule in the active site (see Figure 53) (such as 
one or more of the amino acid residues defined in set 1 or set 2); 
iii) determining if one or more amino acid residues selected in accordance with 
step (ii) are highly conserved (particularly are active site residues and/or part of 

30 the GDSx motif and/or part of the GANDY motif); and 
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iv) modifying one or more amino acids selected in accordance with step (ii), 
excluding conserved regions identified in accordance with step (iii) in said 
parent sequence. 

5 In one embodiment the amino acid residue selected may reside within a 9, preferably 
within a 8, 7, 6, 5, 4, or 3 A sphere centred on the central carbon atom of the glycerol 
molecule in the active site (see Figure 53). 

Alternatively to, or in combination with, the structural homology mapping described 
10 above, the structureal homology mapping can be performed by selecting specific loop 
regions (LRs) or intervening regions (TVRs) derived fi-om the pfam aligmment 
(Alignment 2, Figure 56) overlayed with the PI 0480 model and IIVN. The loop 
regions (LRs) or intervening regions (IVRs) are defined in the Table below: 



PI 0480 amino acid positions (SEQ ED No 
2) 

rvRi 

1-19 

Loopl (LRl) 

20-41 

IVR2 

42-76 

Loop2 (LR2) 

77-89 

rVRS 

90-117 

Loop3 (LRS) 

118-127 

IVR4 

128-145 

Loop4 CLR4) 

146-176 

rVRS 

177-207 

Loops (LR5) 

208-287 

IVR6 

288-317 


15 

In some embodiments of the present invention the variant acyltransferase enzyme not 
only comprises an amino acid modifications at one or more of the amino acids defined 
in any one of sets 1 -4 and 6-7, but also comprises at least one amino acid modification 


w 
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in one or more of the above defined intervening regions (TVRl -6) (preferably in one or 
more of the IVRs 3, 5 and 6, more preferably in IVR 5 or IVR 6) and/or in one or more 
of the above-defined loop regions (LRl-5) (preferably in one or more of LRl, LR2 or 
LR5, more preferably in LR5). 

5 

In one embodiment, the variant acyltransferase according to the present invention or 
obtained by a method according to the present invention may comprise one or more 
amino acid modification which is not only defined by one or more of set 2, 4, 6 and 7, 
but also is within one or more of the IVRs 1-6 (preferably within IVR 3, 5 or 6, more 
10 preferably within in IVR 5 or IVR 6) or within one or more of the LRs 1-5 (preferably 
within LRl, LR2 or LRS, more preferably within LR5). 

Suitably, the variant acyltransferase according to the present invention or obtained by a 
method according to the present invention may comprise one or more amino acid 
15 modification which is not only in set 1 or 2, but also is within IVR 3. 

Suitably, the variant acyltransferase according to the present invention or obtained by a 
method according to the present invention may comprise one or more amino acid 
modification which is not only in set 1 or 2, but also is within IVR 5. 

20 

Suitably, the variant acyltransferase according to the present invention or obtained by a 
method according to the present invention may comprise one or more amino acid 
modification which is not only in set 1 or 2, but also is within IVR 6. 

25 Suitably, the variant acyltransferase according to the present invention or obtained by a 
method according to the present invention may comprise one or more amino acid 
modification which is not only in set 1 or 2, but also is within LR 1 . 

Suitably, the variant acyltransferase according to the present invention or obtained by a 
30 method according to the present invention may comprise one or more amino acid 
modification vAnch is not only in set 1 or 2, but also is within LR 2. 
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Likewise, in some embodiments of the present invention the variant acyltransferse 
enzyme not only comprises an amino acid modification at one or more amino acid 
residues which reside within a 10, preferably within a 9, 8, 7, 6, 5, 4, or 3, A sphere 
centred on the central carbon atom of the glycerol molecule in the active site (see 
Figure 53), but also comprises at least one amino acid modification in one or more of 
the above defined intervening regions (IVRl-6) (preferably in one or more of IVRs 3, 
5 and 6, more preferably in IVR 5 or IVR 6) and/or in one or more of the above- 
defined loop regions (LRl-5) (preferably in one or more of LRl, LR2 or LR5, more 
preferably in LR5). 

hi one embodiment, preferably the amiDO acid modification is at one or more amino 
acid residues which reside within a lOA sphere and also within LR5. 


Thus, the structural homology mapping may comprise one or more of the following 
15 steps: 

i) aligning a parent sequence with a structural model (UVN.PDB) shown in 
Figure 52; 

ii) selecting one or more amino acid residue within a lOA sphere centred on 
the central carbon atom of the glycerol molecule in the active site (see 

20 Figure 53) (such as one or more of the amino acid residues defimied in set 1 

or set 2); and/or selecting one or more amino acid residues within IVR 1-6) 
(preferably within IVR 3, 5 or 6, more preferably within in IVR 5 or IVR 
6); and/or selecting one or more amino acid residues within LRl -5 
(preferably within LRl, LR2 or LR5, more preferably within LR5); and 

25 iii) modifying one or more amino acids selected in accordance with step (ii) in 

said parent sequence. 

In one embodiment the amino acid residue selected may reside within a 9, preferably 
within a 8, 7, 6, 5, 4, or 3 A sphere centred on the central carbon atom of the glycerol 
30 molecule in the active site (see Figure 53). 
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Smtably, the structural homology mapping may comprise one or more of the following 
steps: 

i) aligning a parent sequence with a structural model (HVN.PDB) shown in 
Figure 52; 

5 ii) selecting one or more amino acids within a lOA sphere centred on the 
central carbon atom of the glycerol molecule in the active site (see Figure 
53) (such as one or more of the amino acid residues defined in set 1 or set 
2); and/or selecting one or more amino acid residues within IVRl-6) 
(preferably within IVR 3, 5 or 6, more preferably within in TVR 5 or IVR 
10 6); and/or selecting one or more amino acid residues within LRl-5 

(preferably within LRl, LR2 or LR5, more preferably within LR5); 
iii) determining if one or more amino acid residues selected in accordance with 
step (ii) are highly conserved (particularly are active site residues and/or 
part of the GDSx motif and/or part of the GANDY motif); and 
15 modifying one or more amino acids selected in accordance with step Qi), excluding 
conserved regions identified in accordance with step (iii) in said parent sequence. 


Suitably, the one or more amino acids selected in the methods detailed above are not 
only within a lOA sphere centred on the central carbon atom of the glycerol molecule 
20 in the active site (see Figure 53) (such as one or more of the amino acid residues 
defined in set 1 or set 2), but are also within one or more of the FVRs 1-6 (preferably 
within IVR 3, 5 or 6, more preferably within in IVR 5 or IVR 6) or within one or more 
of the LRs 1-5 (preferably within LRl, LR2 or LR5, more preferably within LR5). 


25 In one embodiment, preferably the one or more amino acid modifications is/are within 
LRS. When it is the case that the modification(s) is within LRS, the modification is 
not one which is defined in set 5. Suitably, the one or more amino acid modifications 
not only fall with the region defined by LRS, but also constitute an amino acid within 
one or more of set 2, set 4, set 6 or set 7. 

30 

Suitably, the sequence homology alignment may comprise one or more of the 
following steps: 
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i) selecting a first parent lipid acyltransferase; 

ii) identifying a second related lipid acyltransferase having a desirable activity; 

iii) aligning said first parent lipid acyltransferase and the second related lipid 
acyltransferase; 

5 iv) identifying amino acid residues that dififer between the two sequences; and 

v) modifying one or more of the amino acid residues identified in accordance with 
step (iv) in said parent lipid acyltransferase. 


Suitably, the sequence homology alignment may comprise one or more of the 
10 following steps: 

i) selecting a first parent lipid acyltransferase; 

ii) identifying a second related lipid acyltransferase having a desirable activity; 

iii) aligning said first parent lipid acyltransferase and the second related lipid 
acyltransferase; 

15 iv) identifying amino acid residues that differ between the two sequences; 

v) determiixing if one or more amino acid residues selected in accordance with 
step (iv) are highly conserved (particularly are active site residues and/or part 
of the GDSx motif and/or part of the GANDY motif); and 

vi) modifying one or more of the amino acid residues identified in accordance with 
20 step (iv) excluding conserved regions identified in accordance with step (v) in 

said parent sequence. 

Suitably, said first parent lipid acyltransferase may comprise any one of the following 
amino acid sequences: SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, 
25 SEQ ID No. 6, SEQ ID No. 12, SEQ ID No. 14, SEQ BD No. 16, SEQ ID No. 1 8, SEQ 
ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, SEQ ID No. 28, SEQ ID 
No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, SEQ ID No. 37, SEQ ID No. 
39, SEQ ID No. 41 , SEQ ID No. 43 or SEQ ID No. 45. 


30 Suitably, said second related lipid acyltransferase may comprise any one of the 
following amino acid sequences: SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ 
ID No. 5, SEQ ID No. 6, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 16, SEQ ID 
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No. 18, SEQ DD No. 20, SEQ ED No. 22, SEQ ID No. 24, SEQ ID No. 26, SEQ ID No. 
28, SEQ ED No. 30, SEQ E) No. 33, SEQ ID No. 34, SEQ ID No. 36, SEQ ID No. 37, 
SEQ ID No. 39, SEQ ID No. 41, SEQ ID No. 43 or SEQ ID No. 45. 

5 The variant enzyme must comprise at least one amino acid modification compared 
with the parent enzyme. In some embodiments, the variant enzyme may comprise at 
least 2, preferably at least 3, preferably at least 4, preferably at least 5, preferably at 
least 6, preferably at least 7, preferably at least 8, preferably at least 9, preferably at 
least 10 amino acid modifications compared with the parent enzyme. 

10 

Suitably the methods according to the present invention may comprise a fiirther step of 
formulating the variant enzyme into an enzyme composition and/or a foodstuff 
composition, such as a bread improving composition. 

15 In order to align a GDSx polypeptide sequence (parent sequence) with SEQ ID No. 2 
(P01480), sequence alignment such as pairwise aligmnent can be used 
(http://www.ebi.ac.uk/e mboss/align/index.html\ Thereby, the equivalent amino acids 
in alternative parental GDSx polypeptides, which correspond to one or more of the 
amino acids defined in set 2 or set 4 or set 6 or set 7 in respect of SEQ ID No. 2 can be 

20 determined and modified. As the skilled person will readily appreciate, when using 
the emboss pairwise alignment, standard settings usually sufiBce. Corresponding 
residues can be identified using "needle" m order to make an alignment that covers the 
whole length of both sequences. However, it is also possible to find the best region of 
similarity between two sequences, using *Svater". 

25 

Alternatively, particularly in instances where parent GDSx polypeptides share low 
homology with SEQ ID No. 2, the corresponding amino acids in alternative parental 
GDSx polypeptides which correspond to one or more of the amino acids defined in set 
2, set 4, set 6 or set 7 in respect of SEQ ID No. 2 can be determined by structural 
30 alignment to the structural model of P10480, obtained by comparison of P10480 
derived structural model with the stmctural coordinates of IIVNJ^DB and IDEOJ^DB 
using the 'Deep View Swiss-PDB viewer' (obtained fiom www.expasv.org/sDdbv/) 
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(Figure 53 and Example 1). Equivalent residues are identified as those overlapping or 
in closest proximity to the residues in the obtained structural model of PO 10480, as 
illustrated in the Table comparing Set 1 and Set 2 (see section entitled "DejBnition of 
Sets" hereinabove), hi this v/ay other GDSX polypeptides can be compared against 
5 the UVN.PBD crystal co-ordinates, and equivalent residues to Set I determined. 

Alternatively, particularly in instances where a parent GDSx polypeptide shares a low 
homolo^ with SEQ ID No. 2, the equivalent amino acids in alternative parental GDSx 
polypeptides, which correspond to one or more of the amino acids defined in set 2 or 
set 4 or set 6 or set 7 in respect of SEQ ID No. 2 can be determined firom an alignment 
obtained from the PFAM database (PFAM consensus) modified based on the structural 
alignment as shown in Alignment 1 (Figure 55). The modification based on the 
structural models may be necessary to slightly shift the alignment in order to ensure a 
best fit overlap. Alignment 1 (Figure 55) provides guidance in this regard. 

The variant enzyme according to the present invention preferably does not comprise 
one or more of the amino acid modifications defined in set 5. 

Suitably the variant enzyme may be prepared using site directed mutagenesis. 
20 

Altematively, one can introduce mutations randomly for instance using a commercial 
kit such as the GeneMorph PGR mutagenesis kit firom Stratagene, or the Diversify 
PGR random mutagenesis kit firom Clontech. EP 0 583 265 refers to methods of 
optimising PGR based mutagenesis, which can also be combined with the use of 
25 mutagenic DNA analogues such as those described in EP 0 866 796. Error prone PGR 
technologies are suitable for the production of variants of lipid acyl transferases with 
preferred characteristics. WO0206457 refers to molecular evolution of lipases. 

A third method to obtain novel sequences is to fiagment non-identical nucleotide 
30 sequences, either by using any number of restriction enzymes or an enzyme such as 
Dnase I, and reassembling fiill nucleotide sequences coding for fimctional proteins 
(hereinafter referred to as "shuflQing"). Alternatively one can use one or multiple non- 
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identical nucleotide sequences and introduce mxrtations during the reassembly of the 
full nucleotide sequence. DNA shuffling and family shuffling technologies are suitable 
for the production of variants of lipid acyl transferases with preferred characteristics. 
Suitable methods for performing 'shufflmg' can be found in EPO 752 008, EPl 138 
5 763, EPl 103 606. Shuffling can also be combined with other forms of DNA 
mutagenesis as described in US 6,180,406 and WO 01/34835. 

Thus, it is possible to produce numerous site directed or random mutations into a 
nucleotide sequence, either in vivo or in vitro^ and to subsequently screen for improved 
1 0 functionality of the encoded variant polypeptide by various means. 

« 

As a non-limiting example. In addition, mutations or natural variants of a 
polynucleotide sequence can be recombined with either the v^ld type or other 
mutations or natural variants to produce new variants. Such new variants can also be 
1 5 screened for improved functionality of the encoded polypeptide. 

The following regions may preferably be selected for localised random mutagenesis 
and/or shuffling: IVR3, IVR 5, IVR 6, LRl , LR2, and/or LR5, most preferably LR5. 

20 For the production of libraries of variants microbial eukaryotic or prokaryotic 
expression hosts may be used. In order to ensure uniform expression within a library 
of variants, low copy number, preferably single event chromosomal expression 
systems may be preferred. Expression systems with high transformation frequencies 
are also preferred, particularly for the expression of large variant libraries (>1000 

25 colonies), such as those prepared using random mutagenesis and/or shufOing 
technologies. 

Suitable methods for the use of a eukaryotic expression host, namely yeast, in the 
production of enzymes are described inEP1131416. Microbial eukaryotic expression 
30 hosts, such as yeast, may be preferred for the expression of variant libraries produced 
using a eukaryotic acyltransferase parent gene. 
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Sxiitable methods using Bacillus^ i.e. Bacillus subtiliSy as an expression host in the 
production of enzymes are described in WO02/14490. Microbial prokaryotic 
expression hosts, such as Bacillus, may be preferred for the expression of variant 
libraries produced using a prokaryotic acyltransferase parent gene, for example the 
5 P10480 reference sequence (SEQ E) No 2). 

Suitably, the variant lipid acyltransferase according to the present invention retains at 
least 70%, preferably at least 80%, preferably at least 90%, preferably at least 95%, 
preferably at least 97%, preferably at least 99% homology with the parent enzyme. 

10 

Suitable parent enzymes may include any enzyme with esterase or lipase activity. 

Preferably, the parent enzyme aligns to the pfam00657 consensus sequence. 

15 In a preferable embodiment a variant lipid acyltransferase enzyme retains or 
incorporates at least one or more of the pfam00657 consensus sequence amino acid 
residues found in the GDSx, GANDY and HPT blocks. 

Enzymes, such as lipases with no or low lipid acyltransferase activity in an aqueous 
20 environment may be mutated using molecular evolution tools to introduce or enhance 
the transferase activity, thereby producing a variant hpid acyltransferase enzyme with 
significant transferase activity suitable for use in the compositions and methods of the 
present invention. 

25 Suitably, the lipid acyltransferase for xise in the invention may be a variant v^th 
enhanced enzyme activity on polar lipids, preferably glycolipids, when compared to 
the parent enzyme. Preferably, such variants also have low or no activity on lyso 
polar lipids. The enhanced activity on polar lipids, preferably glycolipids may be the 
result of hydrolysis and/or transferase activity or a combination of both. 

30 
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Variant lipid acyltransferases for use in the invention may have dea-eased activity on 
triglycerides, and/or monoglycerides and/or digjycerides compared with the parent 
en^me. 

5 Suitably the variant enzyme may have no activity on triglycerides and/or 
monoglycerides and/or diglycerides. Low activity on triglycerides is preferred in 
variant enzymes which are to be used for bakery applications, for treatment of egg or 
egg-based products and/or for degumming oils. 

10 In one embodiment, suitably the variant emyme may have a high activity on 
diglycerdies and no or low activity on triglycerides. 

When referring to specific amino acid residues herein the numbering is that obtained 
from alignment of the variant sequence with the reference sequence shown as SEQ ID 
15 No. 2. 

In one aspect preferably the variant enzyme comprises one or more of the following 
amino acid substitutions: 

20 S3A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W, or Y; and/or 
L17A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W, or Y; and/or 
SI 8A, C, D, E, F, H, I, K, L, M, N, P, Q, R, T, W, or Y; and/or 
K22A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
M23A, C, D, E, F, G, H, I, K, L, N, P, Q, R, S, T, V, W, or Y; and/or 

25 Y30A, C, D, E, G, H, I, K, L, M, N, P. Q, R, S, T, V, or W; and/or 
G40A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
N80A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W, or Y; and/or 
P81 A, C, D, E, F, G, H, I. K, L, M, N, Q, R, S, T, V, W, or Y; and/or 
K82A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 

30 N87A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W, or Y; and/or 
N88A, C, D, E, F, G, H, I, K. L, M, P, Q, R, S, T, V, W, or Y; and/or 
WlllA, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; and/or 
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VI 12A, C, D, E, F, G, H, I, K, L, M, N, P, a R. S, T, W, or Y; and/or 
Al 14C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
Yl 17A, C, D, E, F, G, a I, K, L, M, N, P, Q, R, S, T, V, or W; and/or 
LI 1 8A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W, or Y; and/or 
5 PI 56A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W, or Y; and/or 
D157A, C, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W, or Y; and/or 
Gl 59A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
Q160A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W, or Y; and/or 
Nl 61 A, C, D, E, F, G, H, I, K, L, M P, Q, R, S, T, V, W, or Y; and/or 

10 PI 62A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W, or Y; and/or 
S163A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W, or Y; and/or 
Al 64C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
R165A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W, or Y; and/or 
S 1 66A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W, or Y; and/or 

15 Q167A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W, or Y; and/or 
K168A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
V169A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W, or Y; and/or 
V170A, C, D, E, F, G, H, I, K, L, M, N, P, Q. R, S, T, W, or Y; and/or 
E171 A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 

20 Al 72C, D, E, F, G, H, I, K, L, M, N. P, Q, R, S, T, V, W, or Y; and/or 
Y179A, C, D, E, F. G, H. I, K, L, M, N, P, Q, R, S, T, V, or W; and/or 
HI 80A, C. D, E, F, G, I, K, L, M, P, Q, R, S, T, V, W, or Y; and/or 
N181 A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W, or Y; and/or 
Ql 82A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W, or Y, preferably K; and/or 

25 M209A, C, D, E, F, G, H, I, K, L, N, P, Q, R, S, T, V, W, or Y; and/or 
L210 A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W, or Y; and/or 
R2 1 1 A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
N215 A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R. S, T, V, W, or Y; and/or 
Y226A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W; and/or 

30 Y230A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V or W; and/or 
BC284A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
M285A, C, D, E, F, G, H, I, K, L, N, P, Q, R, S, T, V, W, or Y; and/or 
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Q289A,C,D,E,F,G,H,I,K,L,M,N,P,R,S,T,V,W,orY; and/or 
V290A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W, or Y; and/or 
E309A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, or Y; and/or 
S310A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W, or Y. 

In addition or alternatively thereto there may be one or more C-tenninal extensions. 
Preferably the additional C-terminal extension is comprised of one or more aliphatic 
amino acids, preferably a non-polar amino acid, more preferably of I, L, V or G. Thus, 
the present invention further provides for a variant enzyme comprising one or more of 
the following C-terminal extensions: 3181, 318L, 318V, 318a 

When it is the case that the residues in the parent backbone differ from those in 
PI 0480 (SEQ ID No. 2), as determined by homology alignment and/or structural 
alignment to PI 0480 and/or IIVN, it may be desirable to replace the residues which 
align to any one or more of the following amino acid residues in P10480 (SEQ ID No. 
2): Ser3, Leul7, Lys22, Met23, Gly40, Asn80, Pro81, Lys82, Asn 87, Asn88, Trpl 11, 
Valll2, AlalH, Tyrll7, LeullS, Prol56, Glyl59, Ghil60, Asnl61, Prol62, Serl63, 
Alal64, Argl65, Serl66, Glnl67, Lysl68, Vall69, Vall70, GIul71, Aial72, Tyrl79, 
HislSO, Asnl81, Ghil82, Met209, Leu210, Arg211, Asn215, Lys284, Met285, 
Gln289, Val290, Glu309 or Ser310, with the residue found in P10480 respectively. 

The following wildtype residues of PI 0480 have been found to be preferable for 
retaining good activity, particularly good transferase activity from a galactolipid:, LI 7, 
Will, R221, S3, G40, N88, K22, Y117, LI 18, N181, M209, M285, E309, M23. Thus 
preferably the variant enzyme comprises the amino acid residue foimd in PI 0480 at 
any one or more of these sites. 

Variant enzymes which have an increased hydrolytic activity against a polar lipid may 
also have an increased transferase activity from a polar lipid. 
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Variant enzymes which have an mcreased hydrolytic activity against a phospholipid, 
such as phosphatidylcholine (PC) may also have an increased transferase activity from 
a phospholipid. 

5 Variant enzymes which have an increased hydrolytic activity against a galactolipid, 
such as DGDG, may also have an increased transferase activity from a galactoUpid. 

Variants enzymes which have an increased transferase activity from a phospholipid, 
such as phosphatidylcholine (PC), may also have an increased hydrolytic activity 
10 against a phospholipid. 

Variants enzymes which have an increased transferase activity from a galactolipid, 
such as DGDG, may also have an increased hydrolytic activity against a galactolipid. 

15 Variants enzymes which have an increased transferase activity from a polar lipid may 
also have an increased hydrolytic activity against a polar lipid. 

Suitably, one or more of the following sites may be involved in substrate binding: 
Leul7; Alall4; Tyrl79; HislSO; Asnl81; Met209; Leu210; Arg211; Asn215; Lys284; 
20 Met285; Ghi289; Val290. 

The variant enzyme in accordance with the present invention may have one or more of 
the following fimctionalities compared with the parent enzyme: 

1) an increased relative transferase activity against galactolipid (DG) compared to PC 
25 calculated as % Tdg/Tpc (as illustrated in Example 8) 

2) an increased absolute transferase activity against galactolipid (DG) (as illustrated in 
Example 8) 

3) an increased transferase activity using galactolipid as donor (Tdg) relative to the 
hydrolytic activity Hdg on galactolipid (DG) (as illustrated in Example 8) 

30 4) an increased absolute tranisferase activity against PC (as illustrated in Example 8) 
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Wherein DG is galactolipid (e.g. DGDG) (and may be herein also referred to as GL) 
and PC is phospholipid (e.g. lecithin). Variants with an increased activity towards 
galactolipid include variants within categories 1), 2) and 3) above. Variants with an 
increased activity on galactohpids may also have an increased activity in 
5 phospholipids (as per category 4) above). 

1. A modijBcation to one or more of the following residues may result in a 
variant enzyme having an increased relative transferase activity against DG 
compared to PC calculated as % Tnc/Tpc: 
10 -318, N215, L210, S310, E309, H180, N80, VI 12, Y30X (where X is selected from 
A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, of W), V290, Q289, K22, G40, 
Y179, M209, L21 1, K22, P81, N87, Yl 17, N181, Y230X (where X is selected from A, 
C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), Q182. 

15 

Typically, one or more of the following substitutions may be preferred: 

S3A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y, preferably M, R, N, G, T, 

Q, P, Y, S, L, E, W, most preferably Q 

K:22A, E, C, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y, preferably A, C, E or R 
20 Y30A, C, D, H, K, M. N, P, Q, R, T, V, W, G, I, L, S, M, A, R or E, preferably H, T, 
W,N,D, C,QG,I,L,S,M,A,RorE 
G40L,N,T,VorA 

N80N, R, D, A, C, E, F, G, H, I, K, L, M, P, Q, S, T, V, W or Y, preferably H, I, Y, C, 
Q,M,S,W,L,N,R,DorF 
25 P81A, C, D, E, F, G H, I, K, L, M, N, Q, R, S, T, V, W or Y, preferably I, M, F, G, V, 
Y,D,CorA 

K82A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y, preferably H, K, S or R 
N87A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T. V, W or Y, preferably I, Y, M, T, Q, 
S,W,F,VorP 

30 N88A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y, preferably C, V, A or F 
V112C 
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Yl 17A, C, D, E. F. H, T, G, I, K, L, M, N, P, Q, R, S, V or W, preferably A, N, E, H, 
T,I,F,C,PorS 

LI 18A, C. D, E, F, G, H, I, K, M, N, P, Q, R. S, T, V, W, or Y, preferably F 
V112A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y, preferably I, M, F, Y, N, 
5 E,T,Q,HorP 

Y179A. C, D, E, F, G, H, I, K, L, M, N, P. Q, R, S, T, V or W, preferably F, C, H, I, L, 
M,P,VorW 

H180K, Q, A, C, D, E, F, G, I, L, M, P, R, S, T, V, W, or Y, preferably M, F, C, K or 
Q 

10 N181AorV 

Q182A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W, or Y, preferably K 
M209L, K, M, A, C, D. E, F, G, H, I, N, P, Q, R, S, T, V, W, or Y, preferably I, F, T, 
D,C,H,L,K,MorP 

L210 G, I, H, E, M, S, W, V, A, R, N, D, Q ,T, C, F, K, P or Y, preferably G, I, H, E, 
15 M,S,W,V,A,R,N,D,QJ,YorF 

R211G. Q, K, D, A, C, E, F, H, I, L, M, N, P, R, S, T, V, W or Y, preferably G, Q, K, 

D, H, I, M, F, P, S, Y, N, C, L or W 

N2I5A, C, D, E, F, G, H, 1, K, L, M, N, P, Q, R, S, T, V, W or Y, preferably I, F, P. T. 
W,HorA 

20 Y230A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V or W, preferably I, T, G, D, R, 

E, V, M or S, most preferably I, D, R or E 

Q289A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y, preferably F, W, H, I, 
Y, L, D, C, K, V, E, G. R, N or P, more preferably R, T, D, K, N or P 
V290A, C, D, E, H, F, G, I, K, L, M, N, P, Q, R, S, T, W or Y; 
25 E309S, Q, R, A, C, D, F, G, H, I, K, L, M, N, P, T, V, W or Y, preferably F, W, N, H, 
I,M,S, Q,R,AorY 

S310A, P, T, H, M, K, G, C, D, E, F, I, L, N, Q, R,V, W or Y, preferably F, Y, C, L, 
K,A,P,T,H,M,KorG 

-318 A, C, D, E, F, G, I, K, L, M, N, P, Q, R, T, V, W, Y, H or S 
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Preferably, one or more of the following modifications may result in a variant enzyme 
having an increased relative transferase activity against DG compared to PC calculated 
as % Tdg/Tpc 

5 S3 A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y, preferably M, R, N, G, T, 
Q, P, Y, S, L, E, W, most preferably Q 
G40L,N,T,VorA 

K82A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y, preferably H, K, S or R 
N88A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y, preferably C, V, A or F 
10 Y230A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V or W, preferably I, T, G, D, R, 
E, V, M or S, most preferably D, R or E 

Q289A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y, preferably F, W, H, I, 
Y. L, D, C, K, V, E, G or P, more preferably R, T, D, K or P 

15 ModiGcation of one or more of the following modifications results in a variant enzyme 
having an increased relative transferase activity against DG compared to PC calculated 
as % Tdq/Tpc: 
-318Y,HorS 
N215H 

20 L210G, I, H, E, M, S, W, V, A, R, N, D, Q or T 

S310A,P,T,H,M,KorG 

E309S, Q,AorR 

H180K,TorQ 

N80N,RorD 
25 V112C 

Y30G, I, L, S, M, A, R or E , more preferably Y30M, A or R 
V290R,E,HorA 
Q289R,T,DorN 
K22E 
30 G40L 

Y179VorR 
M209L,KorM 
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L211G,Q,KorD 
Y230V 
G40Q,LorV 
N88W 
5 N87RorD 

For some embodiments the following substitutions may also be suitable: 

K22AorC 

P81G 
10 N87 M 

Y117A,N,E,HorT 

NlSlAorV 

Y230I 

V290H 
15 N87R,D,EorM 

Q182T 

Preferably, the residues modified in order to increase the ratio of galactolipid 
transferase compared to phospholipid transferase activity are one or more of the 
20 foUowing:: -318, N215, L210, E309, H180, N80. 

Typically, one or more of the following substitutions are preferred: 

-31 8 Y, H or S, most preferably Y 
25 N215H 

L210D,QorT 
E309QorR 
HlSOKorQ 
N80N,RorD 


30 
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2. A modification to one or more of the following residues may result in a variant 
having an increased absolute transferase activity against DG: 
-318, N215, L210, S310, E309, H180, N80, V112, Y30X (where X is selected from A, 
C, D, E, G, H, I, K, L, M, N, P, Q. R, S, T, V, or W), V290, Q 289, K22, G40, Y179, 
5 M209, L21 1, K22, P81, N87, Yl 17, N181, Y230X (where X is selected from A, C, D, 

E, G, H, I, K, L. M. N, P, Q, R, S, T. V, or W), V290, N87, Q182, S3, S310, K82, 
A309. 

In particular, one or more of tiie following modifications may result in a variant having 
10 an increased absolute transferase activity against DG: 

-31 8 Y, H, S, A, C, D, E, F, G, I, K, L, M, N, P, Q, R, T, V or W, preferably Y, H, S or 
I 

N215A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; preferably H, 1, F, P, 
15 T, W or A, most preferably H, S, L, R, Y 

L210G, I, H, E, M, S, W, V, A, R, N, D, Q, T, C, F, K, P or Y. preferably D, Q, T, Y 
orF 

S310A, P, T, H, M, K, G, C, D, E, F, I, L, N, Q, R,V, W or Y. preferably F, Y, C, L, K 
orP, 

20 E309S, Q, R, A, C. D, F, G, H, I, K, L, M, N, P. S, T, V, W or Y; preferably S, Q, R, 

F, W, N, H, I, M or Y, most preferably S, Q, R, N, P or A 

HI 80A, C, D, E. F, G, I. K, Q L, M, P, R, S, T, V, W or Y; preferably K, Q, M, F or C, 

most preferably T, K or Q 

N181AorV 

25 N80N, R, D, A, C, E, F, G, H, I, K, L, M, P, Q, S, T, V, W or Y, preferably H, I, Y, C, 
Q, M, S, W, L, N, R, D or F, most preferably N, R, D, P, V, A or G 
VI 12A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; preferably 1, M, F, Y, 
N,E,T,Q,HorP 

Y30G, I, L, S, A, E, C, D, H, K, M, N, P, Q, R, T, V or W, preferably H, T, W. N, D, 
30 CorQ 

V290A, C, D, E, F, G, H, I, K, L, M, N, P. Q, R, S, T, W or Y; 
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Q289A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; preferably R, E, G, P, 
NorR 

K22A, C, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y, preferably C 
Y179A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V or W; preferably F, C, H, I, L, 
5 M,P or W, more preferably E,R,N,V,K,S 

M209A, C, D, E, F, G, H, I, L, K, M, N, P, Q, R, S, T, V, W or Y; preferably R, N, Y, 
EorV 

R21 1 A, C, E, F, G, H, I, L, M, N, P, Q, K, D, R, S, T, V, W or Y, preferably H, I, M, 
F, P, S, Y, N, C, L or W, most preferably R 
10 S310 C, D, E, F, I, L, N, Q, R,V, W or Y. preferably F, Y, C, L, K or P 

S3A, C, D, E, F, G, H, I, K, L, M, N. P, Q, R, T, V, W or Y, preferably M, R, N, A, G, 
T, Q, P, Y or S most preferably Q or N 

K82A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y, preferably H, K, S, E or 
R 

15 P81A, C, D, E, F, G3, 1, K, L, M, N, Q, R, S, T, V, W or Y; preferably I, M, F, V, Y, 
D.CorA 

N87A, C, F, G, H, I, K, L, M, P, Q, R, D, E, S, T, V, W or Y; preferably L, G or A 
Yl 17A, N, E, H, T, C, D, F, G, I, K, L, M, P, Q, R, S, V or W; preferably I, F, C, P or 
S 

20 N87A, C, F, G, H, I. K, L, P, Q, S, T, V, W or Y; preferably I, Y, T, Q, S, W, F, V or P 
Q182A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y, preferably D or K 
Y230A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V or W, preferably W, H, Q, L, P 
or C, most preferable T or G 

Dl 57A, C, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y, preferably C 
25 G40L 
Y226I 

Typically, one or more of the following substitutions are preferred: 
-318Y,HorS 
30 N2I5H 

L210G, I, H, E, M, S, W, V. A, R, N, D, Q or T 
S310A,P,T,H,M,KorG 
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E309S,QorR 
H180KorQ 
N80N,RorD 
V112C 

5 Y30G, I, L, S, M, A, R or E , more preferably Y30M, A or R 
V290R,E,HorA 
Q289RorN 
K22E 
G40L 
10 Y179V 

M209L,KorM 
L211G,Q,KorD 

For some embodiments the following substitutions may also be suitable: 
15 K22AorC 
P81G 
N87M 

Y117A,N,E,HorT 
N181AorV 
20 Y230 1 
V290H 

N87R, D.EorM 
Q182T 

25 Preferably, the residues modified in order to increase transferase activity from a 
galactolipid substrate (DGDG) are one or more of the following:: -318, N215, L210, 
E309,H180,N80. 

Typically, one or more of the following substitutions are preferred: 

30 

-3 1 8 Y, H or S, most preferably Y 
N215H 
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L210D,QorT 
E309QorR 
H180KorQ 
N80N,RorD 

5 

3. A modificiation at one or more of the following residues may result in a variant 
enzyme having an increased transferase activity Tdg relative to the hydroI)^c 
activity Hdg on DG: 

Y230, S3I0, HI 80, Q289, G40, N88, Y179, N215, L210, N80, Y30X (where X is 
10 selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), N87, M209, 
R21 1, S18X (where X is specificaUy selected from A, C, D, E, F, H, I, K, L, M, N, P, 
Q,R,T,WorY). 

Preferably, one or more of the following modifications may result in a variant enzyme 
15 having an increased transferase activity Tdg relative to the hydrolytic activity Hdg on 
DG: 

Y230A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T or W, preferably W, H, Q, L, P or 
C 

20 S310A, C, D, E, F, G, H, I, K, L, M, N, Q, R, T, V, W or Y, preferably F, Y, C, L, K 
orP 

Y179A, C, D, E, F, G, H, 1, K, L, M, N, P, Q, S, T, V, or W, preferably F, C, H, I, L, 
M,PorW 

H180A, C, D, E, F, G, I, K, L, M, P, Q, R, S, V, W or Y, preferably M, F or C 
25 Q289A, C, E, F, G, H, I, K, L, M, N, P, R, S, V, W or Y; preferably F, W, H, I, Y, L, 
D,C,K,V,E,GorP 

G40A, C, D, E, F, H, I, K, M, N, P, R, S, T, W or Y; preferably I, P, W or Y 
N88A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V or Y; preferably I or H 
N87A, C. E, F, G, H, I, K, L, M, P, Q, S, T, V, W or Y; preferably I. Y, T, Q, S, W, F, 
30 VorP 
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Typically, one or more of the following substitutions are preferred (such variant 
enzymes may have a decreased hydrolytic activity (galactolipid and/or phospholipids) 
and/or an increased tranferase activity firom galactolipid: 

5 Y179E,R,NorQ 
N215G 

L210D, H, R, E. A, Q, P, N. K, G, R, T, W, I, V or S 
N80G 
Y30L 
10 N87G 

Typically, one or more of the following substitutions are preferred (such variant 
enzymes may have a decreased hydrolytic activity (galactolipid and/or phospholipids) 
whilst retaining signiJBcant tranferase activity from galactolipid: 

15 

Y179E,R,N,Q 
N215G 

L210D, H, R, E, A, Q, P,N, K, G, R, T, W, I, V and S 
N80G 
20 Y30L 
N87G 
H180I,T 
M209Y 

R211 D.TandG 
25 S18G,MandT 
G40RandM 
N88 W 

N87C,D,R,EandG 

30 

4. Modification of one or more of the foUowing residues may result in a variant 
enzyme having an increased absolute transferase activity against phospholipid: 
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S3, D157, S310, E309, Y179,N215. K22, Q289,M23. H180, M209, L210, R2n, P81, 
V112,N80,L82,N88;N87 

5 Specific modifications which may provide a variant enzyme having an improved 
transferase activity fix»m a phospholipid may be selected fi-om one or more of the 
following: 

S3 A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; preferably N, E, K, R, A, 
P or M, most preferably S3A 
10 D157A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y ; preferably Dl 57S, R, 
E,N,G,T,V,Q,KorC 

S3 lOA, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; preferably S3 lOT 
-318 E 

E309A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; preferably E309 R, E, 
15 L,RorA 

Y179A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V or W; preferably Y179 D, T, 
E, R, N, V, K, Q or S, more preferably E, R, N, V, K or Q 

N215A, C, D, E, F, G, H, 1, K, L, M, P, Q, R, S, T, V, W or Y; preferably N215 S, L, 
RorY 

20 K22A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; preferably K22 E, R, C 
or A 

Q289A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; preferably Q289 R, E. 
G,PorN 

M23A, C, D, E, F, G, H, I, K, L N, P, Q, R, S, T, V, W or Y; preferably M23 K, Q, L, 
25 G.TorS 

HI 80A, C, D, E, F, G, I, K, L, M, P, Q, R, S, T, V, W or Y; preferably H180 Q, R or K 
M209 A, C, D, E, F, G. H, I, K, L, N, P, Q, R, S, T, V, W or Y; preferably M209 Q, S, 
R, A,N,Y,E,VorL 

L210A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; preferably L210 R, A, 
30 V,S,T.I,WorM 

R21 1 A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; preferably R21 IT 
P81A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V. W or Y; preferably P81G 
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VI 12A, C, D, E, F, G, H, I, K, L, M, N. P, Q, R, S, T, W or Y; preferably V112C 
N80A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; preferably N80 R, G, N, 

D, P, T, E, V, A or G 

L82A, C, D, E, F, G, H, I, M, N, P, Q, R, S, T, V, W or Y; preferably L82N, S or E 
5 N88A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; preferably N88C 

N87A, C, D, E. F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; preferably N87M or G 

Modification of one or more of the following residues results in a variant enzyme 
having an increased absolute transferase activity against phospholipid: 

10 

S3 N, R, A, G 

M23 K, Q, L , G, T, S 

H180R 

L82G 

15 Y179E,R,N,V,KorQ 
E309R,S,LorA 

5. Residues the modification of which results in an increased transferase activity 
from a galactolipid substrate (DGDG) and an increase in ratio of galactolipid 
20 transferase compared to phospholipid transferase activity include one or more of: - 
318, N215, L210, S3 10, E309, H180, N80, VI 12, Y30X (where X is selected from A, 
C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), V290, Q 289, K22, G40, Y179, 
M209, L21 1, K22, P81, N87, Yl 17, N181, Y230X (where X is selected from A, C, D, 

E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), Q182. 

25 

Typically, one or more of the following substitutions are preferred: 

-318 Y,HorS 

N215H 

L210G, I, H, E, M, S, W, V, A, R, N, D, Q or T 
30 S310A,P,T,H,M,KorG 
E309S,A,QorR 
H180KorQ 
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N80N,RorD 
V112C 

Y30G, I, L, S, M, A, R or E , more preferably Y30M, A or R 
V290R,E,HorA 
5 Q289RorN 
K22E 
G40L 
Y179V 

M209L,KorM 
10 L211G,Q,KorD 

For some embodiments the following substitutions may also be suitable: 

K22AorC 

P81G 
15 N87M 

Y117A,N,E,HorT 

NlSlAorV 

Y230I 

V290H 
20 N87R,D,EorM 

Q182T 

Preferably, the residues modified in order to increase transferase activity fiom a 
galactolipid substrate (DGDG) and/or increase the ratio of galactolipid transferase 
25 compared to phospholipid transferase activity are one or more of the following:: -318, 
N215, L210, E309, HI 80, N80. 

Typically, one or more of the following substitutions arc preferred: 

30 -3 1 8 Y, H or S, most preferably Y 
N215H 
L21GD,QorT 
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E309QorR 
H180KorQ 
N80N,RorD 

5 6. The following wildtype residues of PI 0480 have been found to be preferable 
for retaining good activity, particularly good transferase activity from a 
galactolipid: 

Will, R21 1, N181, S3, L17, G40,N88, Y117, L118,N181, K22, M209, M285, M23 

10 

Preferably, these residues are retained in the variant enzyme. 

When making variant GDSx acyl-transferases for increased activity transferase from 
galactolipid substrates, where the parent en2yme has a residue corresponding to 
15 residues of the P10480 sequence at positions Will, R211, N181, S3, L17, G40, N88, 
Y117, LI 18, N181, K22, M209, M285, M23 other than the residue found m P10480 
the variant may preferably contain a substitution at the corresponding position to 
include the amino acid residue foimd in the PI 0480 sequence. 

20 LI 7 is preferably a hydrophobic amino acid residue 

7- The following combinations may have an increased transferase activity from a 
galactolipid substrate (DGDG) and/or an increase in ratio of galactolipid 
transferase activity compared to phospholipid transferase activity 

25 

N215H& -318Y 
N215H&L210D, Q, orT 
-318Y&L210,D, Q, orT 
N215H & -31 8Y & L210D, Q, or T 

30 

The above combinations may optionally also include a C-tenninal amino acid addition, 
such as -3 1 8Y, H or S, or preferably -3 1 8 Y. 
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The above combinations may optionally also include the following modificatian: 

Suitably one or more of tbe following combinations may have an increased transferase 
5 activity from a galactolipid substrate (DGDG) and/or an increase in ratio of 
galactolipid transferase activity compared to phospholipid transferase activity: 
E309A,QorR. 

N215H & -318Y, H, or S, preferably Y. 
L210D, Q or T & -3 1 8Y, H, or S, preferably Y. 
10 N215H&E309A,QorR 

L210D, Q or T & E309A, Q or R 
-318Y«&E309A,QorR 

The above combinations may optionally also include a substitution at position Q182, 
15 preferably Q182K. 

The following combinations may have an increased transferase activity from a 
galactolipid substrate (DGDG) and/or an increase in ratio of galactolipid transferase 
activity compared to phospholipid transferase activity, and/or an increase in ratio of 
20 galactolipid transferase compared to hydrolytic activity: 

N215H&N80G 

-318Y&N80G 

L210DorQ«&N80G 
25 N215H&N88N 

-318Y&N88N 

L210DorQ&N88N 

N215H & Y30L 

-318Y& Y30L 
30 L210D or Q & Y30L 

N215H&N87G 

-318Y&N87G 
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L210DorQ&N87G 
N215H & Y179E, R, N or Q 
-318Y&Y179E,R,NorQ 
L210D or Q & Y179 E, R, N or Q 

5 

As noted above, when referring to specific amino acid residues herein the numbering 
is that obtained from alignment of the variant sequence Avith the reference sequence 
shown as SEQ ID No. 2. 

1 0 For the avoidance of doubt, when a particular amino acid is taught at a specific site, for 
instance LI 1 8 for instance, this refers to the specific amino acid at residue number 118 
in SEQ ID No. 2. However, the amino acid residue at site 118 in a different parent 
enzyme may be different from leucine. 

15 Thus, when taught to substitute an amino acid at residue 1 1 8, although reference may 
be made to L118 it would be readily understood by the skilled person that when the 
parent enzyme is other than that shown in SEQ ID No. 2, the amino acid being 
substituted may not be leucine. It is, therefore, possible that when substituting an 
amino acid sequence in a parent enzyme which is not the enzyme having the amino 

20 acid sequence shown as SEQ ID No. 2, the new (substituting) amino acid may be the 
same as that taught in SEQ ID No. 2. This may be the case, for instance, where the 
amino acid at say residue 1 18 is not leucine and is, therefore different from the amino 
acid at residue 118 in SEQ ID No. 2. In other words, at residue 1 18 for example, if the 
parent enzyme has at that position an amino acid other than leucine, this amino acid 

25 may be substituted with leucine in accordance with the present invention. 

The term "lipid acyltransferase" as used herein means an enzyme which has 
acyltransferase activity (generally classified as E.C. 2.3.1.x in accordance with the 
Enzyme Nomenclature Reconmiendations (1992) of the Nomenclature Committee of 
30 the International Union of Biochemistry and Molecular Biology), whereby the enzjone 
is enable of transferring an acyl group from a lipid to one or more acceptor substrates. 
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such as one or more of the followmg: a sterol; a stanol; a carbohydrate; a protein; a 
protein subunit; glycerol. 

Preferably the lipid acyltransferase is cq)able of transferring an acyl group from a lipid 
5 to at least a sterol and/or a stanol, for example to cholesterol. 

Preferably, the lipid acyltransferase variant according to the present invention and/or 
for use in the methods and/or uses of the present invention is capable of transferring an 
acyl group from a lipid (as defined herein) to one or more of the following acyl 
10 acceptor substrates: a sterol, a stanol, a carbohydrate, a protein or subunits thereof, or a 
glycerol. 

For some aspects the "acyl acceptor" according to the present invention may be any 
compound comprising a hydroxy group (-OH), such as for example, polyvalent 
15 alcohols, including glycerol; sterol; stands; carbohydrates; hydroxy acids including 
fruit acids, citric acid, tartaric acid, lactic acid and ascorbic acid; proteins or a sub-xmit 
thereof, such as amino acids, protein hydrolysates and peptides (partly hydrolysed 
protein) for example; and mixtures and derivatives thereof. Preferably, the "acyl 
acceptor" according to the present invention is not water. 

20 

In one embodiment, the acyl acceptor is preferably not a monoglyceride and/or a 
diglyceride. 

In one aspect, preferably the variant enzyme is capable of transferring an acyl group 
25 from a Upid to a sterol and/or a stanol. 

In one aspect, preferably the variant en2yme is capable of transferring an acyl group 
from a Upid to a carbohydrate. 

30 In one aspect, preferably the variant enzyme is capable of transferring an acyl group 
from a lipid to a protein or a subunit thereof. Suitably the protein subunit may be one 
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or more of the following: an amino acid, a protein hydrolysate, a peptide, a dipeptide, 
an oligopeptide, a polypeptide. 

Suitably in the protein or protein subnnit the acyl acceptor may be one or more of the 
5 following constituents of the protein or protein subunit: a serine, a threonine, a 
tyrosine, or a cysteine. 

When the protein subunit is an amino acid, suitably the amino acid may be any 
suitable amino acid. Suitably the amino acid may be one or more of a serine, a 
10 threonine, a tyrosine, or a cysteine for example. 

In one aspect, preferably the variant enzyme is capable of transferring an acyl group 
from a lipid to glycerol. 

15 Li one aspect, preferably the variant enzyme is capable of transferring an acyl group 
from a lipid to a hydroxy acid. 

In one aspect, preferably the variant enzyme is capable of transferring an acyl group 
from a Upid to a polyvalent alcohol. 

20 

In one aspect, the variant lipid acyltransferase may, as well as being able to transfer an 
acyl group from a lipid to a sterol and/or a stanol, additionally be able to transfer the 
acyl group from a lipid to one or more of the following: a carbohydrate, a protein, a 
protein subimit, glycerol. 

25 

Preferably, the lipid substrate upon which the variant lipid acyltransferase according to 
the present invention acts is one or more of the following lipids: a phospholipid, such 
as a lecithin, e.g. phosphatidylcholine, a triacylglyceride, a cardiolipin, a diglyceride, 
or a glycoUpid, such as digalactosyldiglyceride (DGDG) or monogalactosyldiglyceride 
30 (MGDG) for example. More preferably, the variant enzyme according to the present 
invention acts on one or both of DGDG and MGDG. Preferably, the variant enzyme 
according to the present invention has no (or has only limited) activity on 
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digalactosylmonoglyceride (DGMG) and monogalactosylmonoglyceride (MGMG). 
Thus preferably the lipid substrate is not one or both of DGMG or MGMG. This lipid 
substrate may be referred to herein as the "lipid acyl donor". The term lecithin as used 
herein encompasses phosphatidylcholine, phosphatidylethanolamine, 
phosphatidylinositol, phosphatidylserine and phosphatidylglyceroL 

The term "galactolipid" as used herein means one or more of DGDG or DGMG. 

The term "phospholipid" as used hereia means lecithin, including phosphatidylcholine. 

^ The term "polar lipid" as used herein means a phospholipids and/or a galactolipid, 
preferably a phospholipids and a galactolipid. 

For some aspects, preferably the lipid substrate upon which the variant lipid 
acyltransferase acts is a phospholipid, such as lecithin, for example 
phosphatidylcholine. 

For some aspects, preferably the lipid substrate is a glycolipid, such as DGDG or 
MGDG for example. 

Preferably the lipid substrate is a food lipid, that is to say a lipid component of a 
foodstuff. 

For some aspects, preferably the variant lipid acyltransferase according to the present 
invention is incapable, or substantially incapable, of acting on a triglyceride and/or a 1- 
monoglyceride and/or 2-monoglyceride. 

Suitably, the lipid substrate or lipid acyl donor may be one or more lipids present in 
one or more of the following substrates: fats, including lard, taUow and butter fat; oils 
including oils extracted from or derived from palm oil, sunflower oil, soya bean oil, 
safflower oil, cotton seed oil, groiind nut oil, com oil, olive oil, peanut oil, coconut oil, 
and rape seed oil. Lecithin from soya^ rape seed or egg yolk is also a suitable lipid 
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substrate. The lipid substrate may be an oat lipid or other plant based material 
containing galactolipids. 

In one aspect the lipid acyl donor is preferably lecithin (such as phosphatidylcholine) 
5 in egg yolk. 

For some aspects of the present invention, the lipid may be selected Jfrom lipids having 
a fatty acid chain length of from 8 to 22 carbons. 

10 For some aspects of the present invention, the lipid may be selected from hpids having 
a fatty acid chain length of from 16 to 22 carbons, more preferably of from 16 to 20 
carbons. 

For some aspects of the present invention, the lipid may be selected from Upids having 
15 a fatty acid chain length of no greater than 14 carbons, suitably from lipids having a 
fatty acid chain length of from 4 to 14 carbons, suitably 4 to 10 carbons, suitably 4 to 8 
carbons. 

Suitably, the variant lipid acyltransferase according to the present invention may 
20 exhibit one or more of the following lipase activities: glycolipase activity (E.C, 
3.1.1.26), triacylglycerol lipase activity (E.G. 3,1.1.3), phospholipase A2 activity (E.G. 
3.1.1.4) or phospholipase Al activity (E.C. 3.1.1.32). The term "glycolipase activity" 
as used herein encompasses "galactolipase activity". 

25 Suitably, the variant lipid acyltransferase according to the present invention may have 
at least one or more of the following activities: glycoUpase activity (E.G. 3.1.1.26) 
and/or phospholipase Al activity (E.G. 3.1.1.32) and/or phospholipase A2 activity 
(E.G. 3.1.1.4). 

30 For some aspects, the variant lipid acyltransferase according to the present invention 
may have at least glycolipase activity (E.G. 3.1.1.26). 
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Suitably, for some aspects the variant lipid acyltransferase according to the present 
invention may be capable of transferring an acyl group from a glycolipid and/or a 
phospholipid to one or more of the following acceptor substrates: a sterol, a stanol, a 
carbohydrate, a protein, glycerol. 

5 

For some aspects, preferably the variant lipid acyltransferase according to the present 
invention is capable of transferring an acyl group from a glycolipid and/or a 
phospholipid to a sterol and/or a stanol to form at least a sterol ester and/or a stanol 
ester. 

10 

For some aspects, preferably the variant lipid acyltransferase according to the present 
invention is capable of transferring an acyl group from a glycolipid and/or a 
phospholipid to a carbohydrate to form at least a carbohydrate ester. 

15 For some aspects, preferably the variant lipid acyltransferase according to the present 
invention is capable of transferring an acyl group from a glycolipid and/or a 
phospholipid to a protein to form at least protein ester (or a protein fatty acid 
condensate). 

20 For some aspects, preferably the variant lipid acyltransferase according to the present 
invention is capable of transferring an acyl group from a glycolipid and/or a 
phospholipid to glycerol to form at least a diglyceride and/or a monoglyceride. 

For some aspects, preferably the variant lipid acyltransferase according to the present 
25 invention does not exhibit triacylglycerol lipase activity (E.G. 3 . 1 . 1 3). 

In some aspects, the variant lipid acyltransferase naay be capable of transferring an 
acyl group from a lipid to a sterol and/or a stanol. Thus, in one embodiment the "acyl 
acceptor*' according to the present invention may be either a sterol or a stanol or a 
30 combination of both a sterol and a stanol. 
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In one embodiment suitably the sterol and/or stanol may comprise one or more of the 
following structural features: 

i) a 3-beta hydroxy group or a 3-alpha hydroxy group; and/or 

ii) A:B rings in the cis position or A:B rings in the trans position or C5-C6 is 
unsaturated. 

Suitable sterol acyl acceptors include cholesterol and phytosterols, for example alpha- 
sitosterol, beta-sitosterol, stigmasterol, ergosterol, campesterol, 5,6-dihydrosterol, 
brassicasterol, alpha-spinasterol, beta-spinasterol, gamma-spinasterol, deltaspinasterol, 
fucosterol, dimosterol, ascosterol, serebisterol, episterol, anasterol, hyposterol, 
chondrillasterol, desmosterol, chalinosterol, poriferasterol, clionasterol, sterol 
glycosides, and other natural or synthetic isomeric forms and derivatives. 

In one aspect of the present invention suitably more than one sterol and/or stanol may 
act as the acyl acceptor, suitably more than two sterols and/or stands may act as the 
acyl acceptor. In other words, in one aspect of the present invention, suitably more 
than one sterol ester and/or stanol ester may be produced. Suitably, when cholesterol 
is the acyl acceptor one or more further sterols or one or more stands may also act as 
the acyl acceptor. Thus, in one aspect, the present invention provides a method for the 
in situ production of both a cholesterol ester and at least one sterol or stanol ester m 
combination. In other words, the lipid acyltransferase for some aspects of the present 
invention may transfer an acyl group from a lipid to both cholesterol and at least one 
further sterol and/or at least one stanol. 

In one aspect, preferably the sterol acyl acceptor is one or more of the following: 
alpha-sitosterol, beta-sitosterol, stigmasterol, ergosterol and campesterol. 

In one aspect, preferably the sterol acyl acceptor is cholesterol. When it is the case 
that cholesterol is the acyl acceptor for the variant lipid acyltransferase, the amount of 
free cholesterol in the foodstuff is reduced as compared with the foodstuff prior to 
exposure to the variant lipid acyltransferase and/or as compared with an equivalent 
foodstuff which has not been treated with the variant lipid acyltransferase. 
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Suitable stanol acyl acceptors include phytostanols, for example beta-sitostanol or ss- 
sitostanol. 

5 In one aspect, preferably the sterol and/or stanol acyl acceptor is a sterol and/or a 
stanol other than cholesteroL 

In some aspects, the foodstuff prepared in accordance with the present invention may 
be used to reduce blood serum cholesterol and/or to reduce low density lipoprotein. 
10 Blood serum cholesterol and low density lipoproteins have both been associated with 
certain diseases in humans, such as atherosclerosis and/or heart disease for example. 
Thus, it is envisaged that the foodstuffs prepared in accordance with the present 
invention may be used to reduce the risk of such diseases. 

15 Thus, in one aspect the present invention provides the use of a foodstuff according to 
the present invention for use in the treatment and/or prevention of atherosclerosis 
and/or heart disease. Thus is one aspect the foodstuff may be considered as a 
neutraceutical. 

20 In a further aspect, the present invention provides a medicament comprising a 
foodstuff according to the present invention. 

In a fiirther aspect, the present invention provides a method of treating and/or 
preventing a disease in a himian or animal patient which method comprises 
25 administering to the patient an effective amount of a foodstuff according to the present 
invention. 

Suitably, the sterol and/or the stanol "acyl acceptor" may be found naturally within the 
foodstuff. Alternatively, the sterol and/or the stanol may be added to the foodstuff. 
30 When it is the case that a sterol and/or a stanol is added to the foodstuff, the sterol 
and/or stanol may be added before, simultaneously with, and/or after the addition of 
the lipid acyltransferase according to the present invention. Suitably, the present 
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invention may encompass the addition of exogenous sterols/stanols, particularly 
phytosterols/phytostanols, to the foodstuff prior to or simultaneously wifli the addition 
of the variant enzyme according to the present invention. 

5 For some aspects, one or more sterols present in the foodstuff may be converted to one 
or more stanols prior to or at the same time as the variant lipid acyltransferase is added 
according to the present invention. Any suitable method for converting sterols to 
stanols may be employed. For example, the conversion may be carried out by chemical 
hydrogenation for example. The conversion may be conducted prior to the addition of 
10 the variant lipid acyltransferase in accordance with the present invention or 
simultaneously with the addition of the variant Upid acyltransferase in accordance with 
the present invention. Suitably enzymes for the conversion of sterol to stanols are 
taught in WOOO/061771. 

1 5 Suitably the present invention may be employed to produce ph3^ostanol esters in situ 
in a foodstuff. Phytostanol esters have increased solubility through lipid membranes, 
bioavailability and enhanced health benefits (see for example WO92/99640). 

In some embodiments of the present invention the stanol ester and/or the sterol ester 
20 may be a flavouring and/or a texturiser. In which instances, the present invention 
encompasses the in situ production of flavourings and/or texturisers. 

In one embodiment, the present invention provides a metfiod of producing a plant 
sterol ester and/or stanol ester and lysolecithin in an edible oil (such as a plant oil, such 

25 as soya bean oil for instance) without the formation of free fatty acids by treatment of 
the oil with a variant enzyme according to the present invention. In such instances the 
lysolecithin so produced may be removed using a deguinming process. Any 
degununing process may be used, such as one or more of the known degumming 
processes. Any jfree fatty acids can be removed by deodorizing if necessary. Notably, 

30 any stanol/sterol ester produced in the oil is not removed by the deodorizing process. 
Thus, the edible oil produced comprises sterol esters and/or stanol esters which may 
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have beneficial nutritional and/or nutriceutical effects, such as lowering blood 
cholesterol levels. 

Suitable oils in which this method could be carried out are those comprising inter alia 
5 lecithin and a sterol/stanoL Suitably, the oil is a crude oil when treated. Suitably, the 
edible oil may be one or more of the following: com germ oil, cotton seed oil, linseed 
oil, palm oil, peanut oil, rapeseed oil, sesame oil, soybean oil, sunflower oil and wheat 
germ oil. 

1 0 For some aspects of the present invention, the variant lipid acyltransferase according to 
the present invention may utilise a carbohydrate as the acyl acceptor. The 
carbohydrate acyl acceptor may be one or more of the following: a monosaccharide, a 
disaccharide, an oligosaccharide or a polysaccharide. Preferably, the carbohydrate is 
one or more of the following: glucose, finctose, anhydrofiructose, maltose, lactose, 

15 sucrose, galactose, xylose, xylooligosacharides, arabinose, maltooUgosaccharides, 
tagatose, microthecin, ascopyrone P, ascopyrone T, cortalcerone. 

Suitably, the carbohydrate "acyl acceptor" may be found naturally within the 
foodstuff. Altematively, the carbohydrate may be added to the foodstuflF. When it is 
20 the case that the carbohydrate is added to the foodstuff, the carbohydrate may be added 
before, simultaneously with, and/or after the addition of the variant lipid 
acyltransferase according to the present invention. 

Carbohydrate esters can fiinction as valuable emulsifiers in foodstuffs. Thus, \^^en it 
25 is the case that the enzyme functions to transfer the acyl group to a sugar, the invention 
encompasses the production of a second in situ emulsifier in the foodstuff. 

In some embodiments, the variant lipid acyltransferase may utilise both a sterol and/or 
stanol and a carbohydrate as an acyl acceptor. 

30 

The utihsation of a variant hpid acyltransferase which can transfer the acyl group to a 
carbohydrate as well as to a sterol and/or a stanol is particularly advantageous for 
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foodstuffe comprising eggs. In particular, the presence of sugars, in particular glucose, 
in eggs and egg products is often seen as disadvantageous. Egg yolk may comprise up 
to 1% glucose. Typically, egg or egg based products may be treated with glacose 
oxidase to remove some or all of this glucose. However, in accordance with, the 
5 present invention this \mwanted sugar can be readily removed by "esterifying*' the 
sugar to form a sugar ester. 

For some aspects of the present invention, the variant lipid acyltransferase according to 
the present invention may utilise a protein as the acyl acceptor. Suitably, the protein 
10 may be one or more of the proteins foimd in a food product, for example in a dairy 
product and/or a meat product By way of example only, suitable proteins may be 
those found in curd or whey, such as lactoglobulin. Other suitable protems include 
ovalbumin from egg, gliadin, glutenin, puroindoline, lipid transfer proteins from 
grains, and myosin from meat 

15 

Preferably, the parent lipid acyltransferase enzyme according to the present invention 
may be characterised using the following criteria: 

(i) the enzyme possesses acyl transferase activity which may be defined as 
ester transfer activity whereby the acyl part of an original ester bond of 

20 a lipid acyl donor is transferred to an acyl acceptor to form a new ester; 

and 

(ii) the enzyme comprises the amino acid sequence motif GDSX, wherein 
X is one or more of the following amino acid residues L, A, V, I, F, Y, 
H, Q,T,N,Mor S. 

25 

Preferably, X of the GDSX motif is L. Thus, prefembly the enzyme according to the 
present invention comprises the amino acid sequence motif GDSL. 

The GDSX motif is comprised of four conserved amino acids. Preferably, the serine 
30 within the motif is a catalytic serine of the lipid acyltransferase enzyme. Suitably, the 
serine of the GDSX motif may be in a position corresponding to Ser-16 in Aeromonas 
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hydrophila lipolytic enzyme taught in Bmmlik & Buckley (Journal of Bacteriology 
Apr. 1996, Vol. 178, No. 7, p 2060-2064). 

To determine if a protein has the GDSX motif according to the present invention, the 
5 sequence is preferably compared with the hidden markov model profiles (HMM 
profiles) of the pfam database. 

Pfam is a database of protein domain families. Pfam contains curated multiple 
sequence alignments for each family as well as profile hidden Markov models (profile 
10 HMMs) for identifying these domains in new sequences. An introduction to Pfam can 
be found in Bateman A et al (2002) Nucleic Acids Res. 30; 276-280. Hidden Markov 
models are used in a number of databases that aim at classifying proteins, for review 
see Bateman A and Hafl DH (2002) Brief Bioinform 3; 236-245. 

15 http://wvm.ncbi.nlm.nih.gov/eDtrez/querv.fcgi?cmd===Retrieve&db=PubMedfclist uids 
=1 2230032&dopt=Abstract 

http7/wwwjicbi.nlm.nih.gov/entrez/querv.fcgi?cmd=Retrieve&db=PubMed&list uids 
=1 1 7523 14&dopt=Abstract 

20 For a detailed explanation of hidden Markov models and how they are applied in the 
Pfam database see Durbin R, Eddy S, and Krogh A (1998) Biological sequence 
analysis; probabilistic models of proteins and nucleic acids. Cambridge University 
Press, ISBN 0-521-62041-4. The Hammer software package can be obtained from 
Washington University, St Louis, USA. 

25 

Alternatively, the GDSX motif can be identified using the Hammer software package, 
the instructions are provided in Durbin R, Eddy S, and Krogh A (1998) Biological 
sequence analysis; probabilistic models of proteins and nucleic acids. Cambridge 
University Press, ISBN 0-521-62041-4 and the references therem, and the HMMER2 
30 profile provided within this specification. 
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The PFAM database can be accessed, for example, through several servers which are 
cxirrently located at the following websites. 
http://www.saDger.ac.uk/Software/Pfam/iDdex.shtml 
5 http://pfam.wustl.edu/ 
http://pfam.iouv.iDra.fr/ 
http://pfam.cgb.ki.se/ 

The database offers a search facility where odc can enter a protein sequence. Using the 
10 default parameters of the database the protein sequence will then be analysed for the 
presence of Pfam domains. The GDSX domain is an established domain in the 
database and as such its presence in any query sequence will be recognised. The 
database will return the alignment of the Pfam00657 consensus sequence to the query 
sequence. 

15 

A multiple alignment, including Aeromonas salmonicida or Aeromonas hydrophila 
can be obtained by: 
a) manual 

obtain an alignment of the protein of interest with the Pfam00657 consensus 
20 sequence and obtain an alignment of PI 0480 with the Pfam00657 consensus 

sequence following the procedxire described above; 

or 

25 b) through the database 

After identification of the Pfam00657 consensus sequence the database offers 
the option to show an alignment of the query sequence to the seed alignment of 
the Pfam00657 consensus sequence. PI 0480 is part of this seed ahgnment and 
is indicated by GCAT_AERHY. Both the query sequence and PI 0480 will be . 

30 displayed in the same window. 

The Aeromonas hydrophila reference sequence: 
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The residues of Aeromonas hydrophila GDSX lipase are numbered in the NCBI file 
PI 0480, the numbers in this text refer to the numbers given in that file which in the 
present invention is used to detennine specific amino acids residues which, in a 
preferred embodiment are present in the lipid acyltransferase enzymes of the 
5 invention. 

The Pfam alignment was performed (Figure 33 and Figure 34): 

The following conserved residues can be recognised and in a preferable embodiment 
10 may be present in the variant enzymes for use in the compositions and methods of the 
invention; 

Block 1 - GDSX block 
hid hid hid hid Gly Asp Ser hid 
15 28 29 30 31 32 33 34 35 

Block 2 - GANDY block 
hid Gly hid Asn Asp hid 
130 131 132 133 134 135 

20 

Block 3 - HPT block 

His 

309 

25 Where liid* means a hydrophobic residue selected from Met, De, Leu, Val, Ala, Gly, 
Cys, His, Lys, Trp, Tyr, Phe. 

Preferably the parent and/or variant lipid acyltransferase enzyme for use in the 
compositions/methods of the invention can be aligned using the Pfam00657 consensus 
30 sequence. 

Preferably, a positive match with the hidden markov model profile (HMM profile) of 
the pfam00657 domain family indicates the presence of the GDSL or GDSX domain 
according to the present invaation. 
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Preferably when aligned with the Pfam00657 consensus sequence the parent and/or 
variant lipid acyltransferase for use m the compositions/methods of the invention have 
at least one, preferably more than one, preferably more than two, of the following, a 
5 GDSx block, a GANDY block, a HPT block Suitably, the parent and/or variant lipid 
acyltransferase may have a GDSx block and a GANDY block. Alternatively, the 
parent and/or variant enzyme may have a GDSx block and a HPT block. Preferably the 
parent and/or variant enzyme comprises at least a GDSx block. 

10 Preferably, when aligned with the Pfam00657 consensus sequence the parent and/or 
variant enzyme for use in the compositions/methods of the invention have at least one, 
preferably more than one, preferably more than two, preferably more than three, 
preferably more than four, preferably more than five, preferably more than six, 
preferably more than seven, preferably more than eight, preferably more than nine, 

15 preferably more than ten, preferably more than eleven, preferably more than twelve, 
preferably more than thirteen, preferably more than fourteen, of the following amino 
acid residues vrfien compared to the reference A. liydrophilia polypeptide sequence, 
namely SEQ ID No. 26: 28hid, 29hid, 30hid, 31hid, 32gly, 33 Asp, 34Ser, 35hid, 
130hid, 131Gly, 132Hid, 133Asn, 134Asp, 135hid, 309His 

20 

The pfam00657 GDSX domain is a unique identifier which distinguishes proteins 
possessing this domain fi-om other enzymes. 

The pfam00657 consensus sequence is presented in Figure 1 as SEQ JD No. 1. This is 
25 derived fi-om the identification of the pfam family 00657, database version 6, which 
may also be referred to as pfam00657.6 herein. 

The consensus sequence may be updated by using fiirther releases of the pfam 
database. 

30 

For example, Figures 33 and 34 show the pfam alignment of family 00657, fi-om 
database version 1 1, which may also be referred to as pfamO0657.1 1 herein. 
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The presence of the GDSx, GANDY and HPT blocks are foxind in the pfam fanwly 
00657 from both releases of the database. Future releases of the pfam database can be 
used to identify the pfam family 00657, 

5 

Preferably, the parent lipid acyltransferase enzyme according to the present invention 
may be characterised using the following criteria: 

(i) the enzyme possesses acyl transferase activity which may be defined as 
ester transfer activity whereby the acyl part of an original ester bond of 

10 a lipid acyl donor is transferred to acyl acceptor to form a new ester; 

(ii) the enzyme comprises the amino acid sequence motif GDSX, wherein 
X is one or more of the following amino acid residues L, A, V, I, F, Y, 
H,Q,T,N,MorS.; 

(iii) the enzyme comprises His-309 or comprises a histidine residue at a 
15 position corresponding to His-309 in the Aeromonas hydrophila 

lipolytic enzyme shown in Figure 2 (SEQ ID No. 2 or SEQ ID No. 26). 


Preferably, the amino acid residue of the GDSX motif is L. 


20 In SEQ ID No. 26 the first 18 amino acid residues form a signal sequence. His-309 of 
the fiill length sequence, that is the protein including the signal sequence, equates to 
His-291 of the mature part of the protein, i.e. the sequence without the signal 
sequence. 


25 Preferably, the parent lipid acyltransferase enzyme according to the present invention 
comprises the following catalytic triad: Ser-16, Asp-116 and His-291 or comprises a 
serine residue, an aspartic acid residue and a histidine residue, respectively, at 
positions corresponding to Ser-16, Asp-1 16 and His-291 in the Aeromonas hydrophila 
Hpolytic enzyme shown in Figure 2 (SEQ ID No. 2) or at positions corresponding to 

30 Ser-34, Asp-134 and His-309 of the fiill length sequence shown in Figure 28 (SEQ ID 
No. 26). As stated above, in the sequence shown in SEQ ID No. 26 the first 1 8 amino 
acid residues form a signal sequence. Ser-34, Asp-134 and His-309 of the full length 


wo 2005/066347 


PCT/IB2004/004378 


62 


sequence, that is the protein including the signal sequence, equate to Ser-16, Asp-1 16 
and His-291 of the mature part of the protein, i.e. the sequence without the signal 
sequence. In the pfam00657 consensus sequence, as given in Figure 1 (SEQ ID No. 
1) the active site residues correspond to Ser-7, Asp-1 57 and His-348. 

5 

Preferably, the parent lipid acyltransferase enzyme according to the present invention 
may be characterised using the following criteria: 

(i) the enzyme possesses acyl transferase activity which may be defined as 
ester transfer activity whereby the acyl part of an original ester bond of 

10 a first lipid acyl donor is transferred to an acyl acceptor to form a new 

ester; and 

(ii) the enzyme comprises at least Gly-14, Asp-15, Ser-16, Asp-1 16 and 
His- 191 at positions corresponding to Aeromonas hydrophila enzyme in 
Figure 2 (SEQ ID No. 2) which is equivalent to positions Gly-32, Asp- 
15 33, Ser-34, Asp-134 and His-309, respectively, in Figure 28 (SEQ ID 

No. 26). 


Suitably, the parent lipid acyltransferase enzyme according to the present invention 
may be obtainable, preferably obtained, firom organisms from one or more of the 
20 following genera: Aeromonas^ Corynebacterium, Novosphingobium, Termobifida, 
Streptomyces, Saccharomyces, Lactococcus, Mycobacterium, Streptococcus, 
Lactobacillus, Desuljitobacterium, Bacillus, Campylobacter, Vibrionaceae, Xylella, 
Sulfolobus, Aspergillus, Schizosaccharomyces, Listeria, Neisseria, Mesorhizobium, 
Ralstonia, Xanthomonas and Candida, 

25 

Suitably, the parent lipid acyltransferase enzyme according to the present invention 
may be obtainable, preferably obtained, from one or more of the following organisms: 
Aeromonas hydrophila^ Aeromonas salmonicida, Streptomyces coelicolor, 
Streptomyces rimosus, Mycobacterium, Streptococcus pyogenes, Lactococcus lactis, 
30 Streptococcus pyogenes. Streptococcus thermophilus, Lactobacillus helveticus, 
Desulfitobacterium dehalogenans. Bacillus sp, Campylobacter jejuni, Vibrionaceae^ 
Xylella fastidiosa, Sulfolobus solfataricus, Saccharomyces cerevisiae, Aspergillus 
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terretis, Schizosaccharomyces pombe. Listeria innocua. Listeria monocytogenes. 
Neisseria meningitidis, Mesorhizobitm loti, Ralstonia solanacearum, Xanthomonas 
campestris, Xanthomonas axonopodis, Corynebacterium efficens, Novosphingobium 
aromaiicivorans, Termobifida fiisca and Candida parapsHosis, 

5 

In one aspect, preferably the parent lipid acyltransferase enzyme according to the 
present invention is obtainable, preferably obtained, from one or more of Aeromonas 
hydrophila or Aeromonas salmonicida, 

10 In one aspect, the parent lipid acyltransferase according to the present invention may 
be a lecithinxholesterol acyltransferases (LCAT) or variant thereof (for example a 
variant made by molecular evolution) 

Suitable LCATs are known in the art and may be obtainable from one or more of the 
15 following organisms for example: mammals, rat, mice, chickens, Drosophila 
melanogaster^ plants, including Arabidopsis and Oryza satTva, nematodes, fimgi and 
yeast. 

Preferably, when carrying out a method according to the present invention the product 
20 (i.e. foodstuff) is produced without increasing or substantially increasing the free fatty 
acids in the foodstuff. 

The term '^ansferase" as used herein is interchangeable with the term "lipid 
acyltransferase". 

25 

The terai "galactolipid transferase activity" as used herein means the ability of the 
enzyme to catalyse the transfer of an acyl group from a galactolipid donor to an 
acceptor molecule (other than water), such as glycerol for instance. 

30 Likewise, the term "phospholipids transferase activity" as used herein means the 
ability of the enzyme to catalyse the transfer of an acyl gro\q) from a phospholipids 
donor to an acceptor molecule (other than water), such as glycerol for instance. 
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The tenn "an increased ratio of galactolipase transferase activity compared with 
phospholipid transferase activity" as used herein means the variant enzyme when 
compared with the parent enzyme is able to catalyse galactolipid transferase at a higher 
rate compared with phospholipid transferase. This may mean that both galactolipid 
transferase activity and phospholipid transferase activity are increased compared with 
the parent enzyme or that galactolipid transferase activity is increased whilst 
phospholipid transferase activity is decreased compared with the parent enzyme. It is 
the final relation between the two activities which is important 

Suitably, the lipid acyltransferase as defined herein catalyses one or more of the 
following reactions: mteresterification, transesterification, alcoholysis, hydrolysis. 

The term "interesterification" refers to the enzymatic catalysed transfer of acyl groups 
between a lipid donor and Upid acceptor, wherein the lipid donor is not a fi-ee acyl 
group. 

The term "transesterification" as used herein means the enzymatic catalysed transfer of 
an acyl group fi-om a lipid donor (other than a fi-ee fatty acid) to an acyl acceptor (other 
than water). 

As used herein, the term "alcoholysis" refers to the enzymatic cleavage of a covalent 
bond of an acid derivative by reaction with an alcohol ROH so that one of the products 
combines with the H of the alcohol and the other product combines with the OR group 
of the alcohol. 

As used herein, the term "alcohol" refers to an alkyl compound containing a hydroxyl 
group. 

As used herein, the term "hydrolysis" refers to the enzymatic catalysed transfer of an 
acyl group firom a lipid to the OH group of a water molecule. Acyl transfer which 
results firom hydrolysis requires the separation of the water molecule. 
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The term "galactolipid hydrolytic activity" as used herein means the the ability of the 
enzyme to catalyse the hydrolysis of a galactolipid by transferring an acyl group from 
the galactolipid to the OH group of a water molecule. 

5 

Similarly, the term "phospholipid hydrolytic activity" as used herein means the the 
ability of the enzyme to catalyse the hydrolysis of a phospholipid by transferring an 
acyl group from the phospholipid to the OH group of a water molecule. 

10 The term "an increased ratio of galactolipase transferase activity compared with 
galacolipid hydrolysis activity" as used herein means the variant enzyme when 
compared with the parent enzyme is able to catalyse galactolipid transferase at a higher 
rate compared with galactolipid hydrolysis. This may mean that both galactolipid 
transferase activity and galactolipid hydrolysis activity are increased compared with 

15 the parent enzyme or that galactolipid transferase activity is increased whilst 
galactolipid hydrolysis activity is decreased compared with the parent enzyme. It is 
the final relation between the two activities which is important. 

The term 'Svithout increasing or without substantially increasing the free fatty acids" 
20 as used herein means that preferably the lipid acyl transferase according to the present 
invention has 100% transferase activity (i.e. transfers 100% of the acyl groups from an 
acyl donor onto the acyl acceptor, with no hydrolytic activity); however, the enzyme 
may transfer less than 100% of the acyl groups present in the lipid acyl donor to the 
acyl acceptor. In which case, preferably the acyltransferase activity accounts for at 
25 least 5%, more preferably at least 10%, more preferably at least 20%, more preferably 
at least 30%, more preferably at least 40%, more preferably 50%, more preferably at 
least 60%, more preferably at least 70%, more preferably at least 80%, more 
preferably at least 90% and more preferably at least 98% of the total enzyme activity. 
The % transferase activity (i.e. the transferase activity as a percentage of the total 
30 enzymatic activity) may be determined by the following protocol: 

Protocol for the determination of% acyltransferase activity: 
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A foodstuff to which a lipid acyltransferase according to the present invention has 
been added may be extracted following the enzymatic reaction with CHClaiCHaOH 
2:1 and the organic phase containing the lipid material is isolated and analysed by 
5 GLC according to the procedxire detailed hereinbelow. From the GLC analysis (and if 
necessary HPLC analysis) the amount of free fatty acids and one or more of 
sterol/stanol esters; carbohydrate esters, protein esters; diglycerides; or 
monoglycerides are determined. A control foodstuff to which no enzyme according to 
the present invention has been added, is analysed in the same way. 
10 Calculation: 

From the results of the GLC (and optionally HPLC analyses) the increase in free fatty 
acids and sterol/stanol esters and/or carbohydrate esters and/or protein esters and/or 
diglycerides and/or monoglycerides can be calculated: 

A % fatty acid = % Fatty acid(enzyme) - % fatty acid(control); Mv fatty acid == 
1 5 average molecular weight of the fatty acids; 

A = A % sterol ester/Mv sterol ester (where A % sterol ester = % sterol/stanol 
ester(enzyme) - % sterol/stanol ester(control) and Mv sterol ester = average molecular 
weight of the sterol/stanol esters) - applicable where the acyl acceptor is a sterol 
and/or stanol; 

20 B = A % carbohydrate ester/Mv carbohydrate ester (where A % carbohydrate ester = % 
carbohydrate ester(enzyme) - % carbohydrate ester(control) and Mv carbohydrate ester 
= average molecular weight of the carbohydrate ester) - appUcable where the acyl 
acceptor is a carbohydrate; 

C = A % protein ester/Mv protein ester (where A % protein ester = % protein 
25 ester(enzyme) - % protein ester(control) and Mv protein ester = average molecular 
weight of the protein ester) - applicable where the acyl acceptor is a protein; and 
D = absolute value of diglyceride and/or monoglyceride/Mv di/monoglyceride (where 
A% diglyceride and/or monoglyceride = % diglyceride and/or monoglyceride 
(enzyme) - % diglyceride and/or monoglyceride (control) and Mv di/monoglyceride = 
30 average molecular weight of the diglyceride and/or monoglyceride) - applicable v\4iere 
the acyl acceptor is glycerol. 


1 
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The transferase activity is calculated as a percentage of the total enzymatic activity; 

% transferase activity = A* + B* + C*-fD* x 100 , 

5 A* + B* + C* + D*+A % fatty acid/(Mv fatty acid) 

* - delete as appropriate. 

The amino acids which fall within the terms "non-polar", '*polar - uncharged", "polar 
- charged" are given in the table below, as are the amino acids falling within the terms 
10 "aliphatic" and "aromatic". The term "polar"* refers to both "polar - uncharged" and 
"polar - charged" amino acids. 


I ALIPHATIC 

Non-polar 

GAP 



ILV 


Polar — uncharged 

CSTM 



NQ 


Polar -charged 

DE 



KR 

AROMATIC 


HFWY 


15 GLC analysis 

Perkin Elmer Autosystem 9000 Capillary Gas Chromatograph equipped with WCOT 
fused silica column 12.5 m x 0.25 mm ID x 0.1 \i fihn thickness 5% phenyl-methyl- 
silicone (CP Sil 8 CB from Chrompack). 
20 Carrier gas: Helium. 

Injector. PSSI cold split injection (initial temp SO^'C heated to SSS^^C), volume i,0\il 
Detector FID: 395X 


Oven program: 

1 

2 

3 

Oven temperature, °C. 

90 

280 

350 

25 Isothermal, time, min. 

1 

0 

10 
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Temperature rate, "^Cymin. 15 4 

Sample preparation: 30 mg of sample was dissolved in 9 ml HeptaneiPyridin, 2:1 
containing internal standard heptadecane, 0.5 mg/ml. SOOjxl sample solution was 
5 transferred to a crimp vial, 300 \il MSTFA (N-Methyl-N-trimethylsilyl- 
trifluoraceamid) was added and reacted for 20 minutes at 60°C. 
Calculation: Response factors for mono-di-triglycerides and free fatty acid were 
determined from Standard 2 (mono-di-triglyceride), for Cholesterol, Cholesteryl 
palmitate and Cholesteryl stearate the response factors were determined from pure 
10 reference material (weighing for pure material lOmg). 

TECHNICAL EFFECTS 

The present invention may provide one or more of the following unexpected technical 
15 effects in egg products, particularly mayonnaise: an improved heat stability during 
pasteurisation; improved organoleptic properties, an improved consistency. 

Variant enzymes with increased phospholipid transferase activity, particularly with 
increased tranferase activity between a phospholipid and a sterol and/or stanol, such as 
20 cholesterol, may be particularly useftd in methods for producing lysophospholipids 
and/or for enzymatic degumming of edible oils and/or for the production of egg 
products with improved emulsification properties and/or health benefits. 

For use in methods of enzymatic degumming, variants with an increased absolute 
25 phospholipid transferase to sterol activity are preferred. 

Suitably, the present invention may provide one or more of the following unexpected 
technical effects in egg or in egg products: improved stability of emulsion; thermal 
stability of emulsion; improved flavour; reduced mal-odour; improved thickening 
30 properties, improved consistency. 
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The present invention may provide one or more of the following unexpected technical 
effects in dough and/or baked products: an improved specific volume of either the 
dough or the baked products (for example of bread and/or of cake); an improved 
dough stability; an improved crust score (for example a thinner and/or crispier bread 
5 crust), an improved crumb score (for example a more homogenous crumb distribution 
and/or a finer crumb structure and/or a softer crumb); an improved appearance (for 
example a smooth surface vvithout blisters or holes or substantially without blisters or 
holes); a reduced staling; an enhanced softness; an improved odour; an improved taste, 

10 Suitably, the present invention may provide one or more of the following unexpected 
technical effects in a foodstuff: an improved appearance, an improved mouthfeel, an 
improved stability, in particular an improved themial stability, an improved taste, an 
unproved softness, an improved resilience, an improved emulsification. 

15 Suitably, the present invention may provide one or more of the following unexpected 
technical effects in dairy products, such as ice cream for example: an improved 
mouthfeel (preferably a more creamy mouthfeel); an improved taste; an improved 
meltdown. 

20 Specific technical effects associated with the use of a lipid acyltransferase as defined 
herein in the preparation of a foodstuff are listed in the table below: 



Foodstuff 

Effect 

1 

Bread, Muffins and 
Doughnuts 

Strengthens dough and increases mechanical 
resistance and increases water absorption capacity 
and/or increases volimie of bakery products and 
maintains softness of cnmib and/or reduces 
blisters on the bread surface. 

2 

Frozen dough 

Prevents spoiling during refid geration 

3 

Sponge cake 

Makes good cake volume and/or a uniform soft 
texture 

4 

Biscmt, cracker and 
cookie 

Makes stable emulsions of fat and/or prevCTts 
stickiness to the machine and/or prevents 
blooming of high fat products 

5 

Batter and breading 

Improves texture of fried products. 

6 

Noodles 

Prevents dough firom sticking to the madiine 
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and/or increases water content, and/or decreases 
cooking loss 

7 

Instant noodles 

Prevent noodles form adhering to each other 

8 

Pasta 

Dough conditioner prevents adhesion on 
cooking. 

9 

Custard cream 

Makes starch paste with a smooth and creamy 
texture, and/or prevents dehydration. 

10 

Coffee whitener 

Prevent oil and water separation 

11 

Whipping cream 

Provides stable emulsion 

12 

Chocolate 

Prevents or reduced blooming 

13 

Caramel, candy and 
nougat 

Improves emulsification of molten sugar and oil 
and/or prevents separation of oil. 

1 
4 

Processed meat, 
saiisages 

Improves water holding capacity of sausages and 
pressed ham, and/or prevents separation of oil 
phase of pastes and pate. 


Suitably, the present invention may provide one or more of the following xmexpected 
technical eJBFects in cheese: a decrease in the oiling-oflF effect in cheese; an increase in 
cheese yield; an improvement in flavour; a reduced mal-odour; a reduced "soapy" 
5 taste. 


In one aspect, the present invention is based in part on the realisation that yields of 
foods - such as cheese - may be improved by the use of a lipid acyl transferase. In 
addition or alternatively, the flavour, texture, oxidative stability and/or shelf life of the 
10 food may be improved. In addition or altematively, the food may have a reduced 
cholesterol level or enhanced content of phytosterol/stanol esters. 

The present invention in one aspect may provide a food additive composition 
comprising a lipid acyl transferase as defined herein. 

15 

The present invention may in another aspect provide a cosmetic composition 
comprising a lipid acyl transferase as defined herein. 


20 


In addition, the present invention may provide the use of an acyltransferase as defined 
herein to produce a cosmetic composition. 
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ADVANTAGES 

Variant transferases of the present invention have one or more of the following 
advantageous properties compared with the parent enzyme: 

5 

i) an increased activity on polar lipids and/or an increased activity on polar lipids 
compared to triglycerides. 

ii) an increased activity on galactolipids (glycolipids), such as one or more of 
10 digalactosyl diglyceride (DGDG) and/or monogalactosyl diglyceride (MGDG). 

iii) an increased ratio of activity on galactolipids (glycolipids) compared to either 
phospholipids and/or triglycerides 

15 Preferably variant transferases of the invention have increased activity on digalactosyl 
diglyceride (DGDG) and/or monogalactosyl diglyceride (MGDG). 

Preferably variant transferases of the present invention has increased activity on 
DGDG and/or MGDG and decreased activity on DGMG and/or MGMG. 

20 

The variant transferases of the invention may also have an increased activity on 
triglycerides. 

The variant transferases of the invention may also have an increased activity on 
25 phospholipids, such as lecithin, including phosphatidyl choline. 

Variant transferases of the present invention may have decreased activity on 
triglycerides, and/or monoglycerides and/or diglycerides. 

30 The term polar lipid refers to the polar lipids usually foimd in a dough, preferably 
galactolipids and phospholipids. 
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When used in preparation of a dough or baked product the variant transferase of the 
invention may result in one or more of the following unexpected technical ejffects in 
dough and/or baked products: an improved specijBc volume of either the dough or the 
baked products (for example of bread and/or of cake); an improved dough stabihty; an 
improved crust score (for example a thinner and/or crispier bread crust), an improved 
crumb score (for example a more homogenous crumb distribution and/or a finer crumb 
structure and/or a softer crumb); an improved appearance (for example a smooth 
surface without blisters or holes or substantially without blisters or holes); a reduced 
staling; an enhanced softness; an improved odour, an improved taste. 

ISOLATED 


hi one aspect, preferably the polypeptide or protein for use in the present invention is 
in an isolated form. The term "isolated" means that the sequence is at least 
15 substantially free from at least one other component with which the sequence is 
naturally associated in nature and as found in nature. 

PURIFIED 

20 In one aspect, prefembly the polypeptide or protein for use in the present invention is 
in a purified form. The term "purified" means that the sequence is in a relatively pure 
state - e.g. at least about 51% pure, or at least about 75%, or at least about 80%, or at 
least about 90% pure, or at least about 95% pure or at least about 98% pure. 


25 CLONING A NUCLEOTIDE SEQUENCE ENCODING A POLYPEPTIDE 
ACCORDING TO THE PRESENT INVENTION 

A nucleotide sequence encoding either a polypeptide which has the specific properties 
as defined herein or a polypeptide which is suitable for modification may be isolated 
30 from any cell or organism producing said polypeptide. Various methods are well 
known within the art for the isolation of nucleotide sequences. 
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For example, a genomic DNA and/or cDNA library may be constructed using 
chromosomal DNA or messenger RNA from the organism producing the polypeptide. 
If the amino acid sequence of the polypeptide is known, labelled oligonucleotide 
5 probes may be synthesised and used to identify polypeptide-encoding clones from the 
genomic library prepared from the organism. Alternatively, a labelled oligonucleotide 
probe containing sequences homologous to another known polypeptide gene could be 
used to identify polypeptide-encoding clones. In the latter case, hybridisation and 
washing conditions of lower stringency are used. 

10 

Alternatively, polypeptide-encoding clones could be identified by inserting fragments 
of genomic DNA into an expression vector, such as a plasmid, transforming enzyme- 
negative bacteria with the resulting genomic DNA library, and then plating the 
transformed bacteria onto agar containing an enzyme inhibited by the polypeptide, 
15 thereby allowing clones expressing the polypeptide to be identified. 

In a yet further alternative, the nucleotide sequence encoding the polypeptide may be 
prepared synthetically by established standard methods, e.g. the phosphoroamidite 
method described by Beucage S.L. et al (1981) Tetrahedron Letters 22, p 1859-1869, 
20 or the method described by Matthes et al (1984) EMBO J. 3, p 801-805. Li the 
phosphoroamidite method, oligonucleotides are synthesised, e.g. in an automatic DNA 
synthesiser, purified, annealed, ligated and cloned in appropriate vectors. 

The nucleotide sequence may be of mixed genomic and synthetic origin, mixed 
25 synthetic and cDNA origin, or mixed genomic and cDNA origin, prepared by ligating 
fragments of synthetic, genomic or cDNA origin (as appropriate) in accordance with 
standard techniques. Each ligated fragment corresponds to various parts of the entire 
nucleotide sequence. The DNA sequence may also be prepared by polymerase chain 
reaction (PGR) using specific primers, for instance as described in US 4,683,202 or in 
30 Saiki R K e/ fl/ (Science (1988) 239, pp 487-491). 
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NUCLEOTIDE SEQUENCES 

The present invention also encompasses nucleotide sequences encoding polypeptides 
having the specific properties as defined herein. The term "nucleotide sequence** as used 
5 herein refers to an oligonucleotide sequence or polynucleotide sequence, and variant, 
homologues, firagments and derivatives thereof (such as portions thereof. The nucleotide 
sequence may be of genomic or synthetic or recombinant origin, which may be double- 
stranded or single-stranded whether representing the sense or antisense strand. 

10 The term "nucleotide sequence" in relation to the present mvention includes genomic 
DNA, cDNA, synthetic DNA, and RNA. Preferably it means DNA, more preferably 
cDNA for the coding sequence. 

In a preferred embodiment, the nucleotide sequence per se encoding a polypeptide having 
15 the specific properties as defined herein does not cover the native nucleotide sequence in 
its natural environment vAien it is Unked to its naturally associated sequence(s) that is/are 
also in its/their natural environment For ease of reference, we shall call this preferred 
embodiment the "non-native nucleotide sequence". In this regard, the term "native 
nucleotide sequence" means an entire nucleotide sequence that is in its native 
20 environment and when operatively linked to an entire promoter with which it is naturally 
associated, wiiich promoter is also in its native environment Thus, the polypeptide of the 
present invention can be expressed by a nucleotide sequence in its native organism but 
wherein the nucleotide sequence is not under the control of the promoter with which it is 
naturally associated within that organism. 

25 

Preferably the polypeptide is not a native polypeptide. In this regard, the term "native 
polypeptide" means an entire polypeptide that is in its native environment and when it has 
been expressied by its native nucleotide sequence. 

30 Typically, the nucleotide sequence encoding polypeptides having the specific 
properties as defined herein is prepared using recombinant DNA techniques (i.e. 
recombinant DNA). However, in an alternative embodiment of the mvention, the 
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nucleotide sequence could be synthesised, in whole or in part, using chemical methods 
well known in the art (see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 215-23 
and Hom T et al (1980) Nuc Acids Res Symp Ser 225-232). 

MOLECULAR EVOLUTION 

Once an enzyme-encoding nucleotide sequence has been isolated, or a putative 
enzyme-encoding nucleotide sequence has been identified, it may be desirable to 
modify the selected nucleotide sequence, for example it may be desirable to mutate the 
sequence in order to prepare an enzyme in accordance with the present invention. 

Mutations may be introduced using synthetic oligonucleotides. These oligonucleotides 
contain nucleotide sequences flanking the desired mutation sites. 

A suitable method is disclosed in Morinaga et al (Biotechnology (1984) 2, p646-649). 
Another method of introducing mutations into enzyme-encoding nucleotide sequences 
is described in Nelson and Long (Analytical Biochemistry (1989), 180, p 147-151). 

Instead of site directed mutagenesis, such as described above, one can introduce 
mutations randomly for instance using a commercial kit such as the GeneMorph PGR 
mutagenesis kit firom Stratagene, or the Diversify PGR random mutagenesis kit fi-om 
Clontech. EP 0 583 265 refers to methods of optimising PGR based mutagenesis, 
which can also be combined v^th the use of mutagenic DNA analogues such as those 
described in EP 0 866 796. Error prone PGR technologies are suitable for the 
production of variants of lipid acyl transferases with preferred, characterisitics. 
WO0206457 refers to molecular evolution of lipases. 

A third method to obtain novel sequences is to fragment non-identical nucleotide 
sequences, either by using any number of restriction enzymes or an enzyme such as 
Dnase I, and reassembling full nucleotide sequences coding for fimctional proteins. 
Alternatively one can use one or multiple non-identical nucleotide sequences and 
introduce mutations during the reassembly of the fiill nucleotide sequence. DNA 
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shuffling and family shuffling technologies are suitable for the production of variants 
of lipid acyl transferases with preferred characteristics. Suitable methods for 
performing 'shuffling' can be found in EPO 752 008, EPl 138 763, EPl 103 606. 
Shuflaing can also be combined with other forms of DNA mutagenesis as described in 
5 US 6,180,406 and WO 01/34835. 

Thus, it is possible to produce numerous site directed or random mutations into a 
nucleotide sequence, either in vivo or in vitro^ and to subsequently screen for improved 
fimctionahty of the encoded polypeptide by various means. Using in silico and exo 
10 mediated recombination methods (see WO 00/58517, US 6,344,328, US 6,361,974), 
for example, molecular evolution can be performed where the variant produced retains 
very low homology to known enzymes or proteins. Such variants thereby obtained 
may have significant structural analogy to known transferase enzymes, but have very 
low amino acid sequence homology. 

15 

As a non-limiting example. In addition, mutations or natural variants of a 
polynucleotide sequence can be recombined with either the wild type or other 
mutations or natural variants to produce new variants. Such new variants can also be 
screened for improved functionality of the encoded polypeptide. 

20 

The application of the above-mentioned and similar molecular evolution methods 
allows the identification and selection of variants of the enzymes of the present 
invention which have preferred characteristics without any prior knowledge of protein 
structure or fimction, and allows the production of non-predictable but beneficial 

25 mutations or variants. There are numerous examples of the appUcation of molecular 
evolution in the art for the optimisation or alteration of enzyme activity, such examples 
include, but are not limited to one or more of the following: optimised expression 
and/or activity in a host cell or in vitro, increased enzymatic activity, altered substrate 
and/or product specificity, increased or decreased enzymatic or structural stability, 

30 altered CTzymatic activity/specificity in preferred environmental conditions, e.g. 
temperature, pH, substrate 
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As will be apparent to a person skilled in the art, using molecular evolution tools an 
en2yme may be altered to improve the functionality of the enzyme. 

Suitably, the lipid acyltransferase used in the invention may be a variant, i.e, may 
5 contain at least one amino acid substitution, deletion or addition, when compared to a 
parental enzyme. Variant enzymes retain at least 25%, 30%, 40%, 50 %, 60%, 70%, 
80%, 90%, 95%, 97%, 99% homology vwth the parent enzyme. Suitable parent 
enzymes may include any enzyme with esterase or lipase activity. Preferably, the 
parent enzyme aligns to the pfam00657 consensus sequence. 

10 

In a preferable embodiment a variant lipid acyltransferase enzyme retains or 
incorporates at least one or more of the pfam00657 consensus sequence amino acid 
residues found in the GDSx, GANDY and HPT blocks. 

15 Enzymes, such as lipases with no or low lipid acyltransferase activity in an aqueous 
environment may be mutated using molecular evolution tools to introduce or enhance 
the transferase activity, thereby producing a lipid acyltransferase enzyme with 
significant transferase activity suitable for use in the compositions and methods of the 
present invention. 

20 

Suitably, the lipid acyltransferase for use in the invention may be a variant with 
enhanced enzyme activity on polar lipids, preferably phospholipids and/or glycolipids 
when compared to the parent enzyme. Preferably, such variants also have low or no 
activity on lyso polar lipids. The enhanced activity on polar lipids, phospholipids 
25 and/or glycolipids may be the result of hydrolysis and/or transferase activity or a 
combination of both. 

Variant lipid acyltransferases for use in the invention may have decreased activity on 
triglycerides, and/or monoglycerides and/or diglycerides compared with the parent 
30 enzyme. 
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Suitably the variant enzyme may have no activity on triglycerides and/or 
monoglycerides and/or diglycerides. 

Alternatively, the variant en2yme for use in the invention may have increased activity 
5 on triglycerides, and/or may also have increased activity on one or more of the 
following, polar lipids, phospholipids, lecithin, phosphatidylcholine, glycolipids, 
digalactosyl monoglyceride, monogalactosyl monoglyceride. 

Variants of lipid acyltransferases are known, and one or more of such variants may be 
10 suitable for use in the methods and uses according to the present invention and/or in 
the enzyme compositions according to the present invention. By way of example only, 
variants of lipid acyltransferases are described in the following references may be used 
in accordance with the present invention: Hilton & Buckley J Biol. Chem. 1991 Jan 
15: 266 (2): 997-1000; Robertson et al J. Biol. Chem. 1994 Jan 21; 269(3):2146-50; 
15 Bnmilik et al J. Bacteriol 1996 Apr; 178 (7): 2060-4; Peelman et al Protein Sci. 1998 
Mar; 7(3):587-99. 

AMINO ACID SEQUENCES 

20 The present invention also encompasses amino acid sequences of polypeptides having 
the specific properties as defined herein. 

As used herein, the term "amino acid sequence" is synonymous with the term 
"polypeptide" and/or the term ''protein". In some instances, the term "amino acid 
25 sequence" is synonymous with the term "peptide". 

The amino acid sequence may be prepared^solated from a suitable source, or it may be 
made synthetically or it may be prepared by use of recombinant DNA techniques. 

30 Suitably, the amino acid sequences may be obtained from the isolated polypeptides 
taught herein by standard techniques. 
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One suitable method for detenniniiig amino acid sequences from isolated polypeptides 
is as follows: 

Purified polypeptide may be freeze-dried and 100 \ig of the freeze-dried material may 
5 be dissolved in 50 \il of a mixture of 8 M urea and 0.4 M ammonium hydrogen 
carbonate, pH 8.4. The dissolved protein may be denatured and reduced for 1 5 minutes 
at 50°C following overlay with nitrogen and addition of 5 nl of 45 mM dithiothreitoL 
After cooling to room temperature, 5 ^l of 100 mM iodoacetamide may be added for 
the cysteine residues to be derivatized for 15 minutes at room temperature in the dark 
10 under nitrogen. 

135 fil of water and 5 \ig of endoproteinase Lys-C in 5 |il of water may be added to the 
above reaction mixture and the digestion may be carried out at 37°C under nitrogen for 
24 hours. 

15 

The resulting peptides may be separated by reverse phase HPLC on a VYDAC CI 8 
colunm (0.46x1 5cm;10^m; The Separation Group, California, USA) using solvent A: 
0. 1% TFA in water and solvent B: 0.1% TFA in acetonitrile. Selected peptides may be 
re-chromatographed on a Develosil CI 8 column using the same solvent system, prior 
20 to N-terminal sequencing. Sequencing may be done using an Apphed Biosystems 
476A sequencer using pulsed Uquid fast cycles according to the manufecturer's 
instructions (Applied Biosystems, California, USA). 

25 SEQUENCE IDENTITY OR SEQUENCE HOMOLOGY 

The present invention also encompasses the use of sequences having a degree of 
sequence identity or sequence homology with amino acid sequence(s) of a polypeptide 
having the specific properties defined herein or of any nucleotide sequence encoding 
30 such a polypeptide (hereinafter referred to as a "homologous sequence(s)")- Here, the 
term "homologue" means an entity having a certain homology with the subject amino 
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acid sequences and the subject nucleotide sequences. Here, the term "homology" can 
be equated with "identity". 

The homologous amino acid sequence and/or nucleotide sequence should provide 
5 and/or encode a polypeptide which retains the functional activity and/or enhances the 
activity of the enzyme. 

In the present context, a homologous sequence is taken to include an amino acid 
sequence which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% 
10 identical to the subject sequence. Typically, the homologues will comprise the same 
active sites etc. as the subject amino acid sequence. Although homology can also be 
considered in terms of similarity (i.e. amino acid residues having similar chemical 
properties/ftmctions), in the context of the present invention it is preferred to express 
homology in tenns of sequence identity. 

15 

In the present context, a homologous sequence is taken to include a nucleotide 
sequence which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% 
identical to a nucleotide sequence encoding a polypeptide of the present invention (the 
subject sequence). Typically, the homologues will comprise the same sequences that 
20 code for the active sites etc. as the subject sequence. Although homology can also be 
considered in terms of similarity (i.e. amino acid residues having similar chemical 
properties/functions), in the context of the present invention it is preferred to express 
homology in terms of sequence identity. 

25 Homology comparisons can be conducted by eye, or more usually, with the aid of 
readily available sequence comparison programs. These commercially available 
computer programs can calculate % homology between two or more sequences. 

% homology may be calculated over contiguous sequences, i.e. one sequence is 
30 aligned with the other sequence and each amino acid in one sequence is directly 
compared with the corresponding amino acid in the other sequence, one residue at a 
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time. This is called an "ungapped" alignment Typically, such ungapped alignments 
are performed only over a relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into consideration 
5 that, for example, in an otherwise identical pair of sequences, one iusertion or deletion 
will cause the following amino acid residues to be put out of alignment, thus 
potentially resulting in a large reduction in % homology when a global alignment is 
performed. Consequently, most sequence comparison methods are designed to 
produce optimal ahgrunents that take into consideration possible insertions and 
10 deletions without penalising imduly the overall homology score. This is achieved by 
inserting "gaps" in the sequence alignment to try to maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs 
in the alignment so that, for the same number of identical amino acids, a sequence 

15 alignment with as few gaps as possible - reflecting higher relatedness between the two 
compared sequences - will achieve a higher score than one with many gaps. "AfBne 
gap costs" are typically used that charge a relatively high cost for the existence of a 
gap and a smaller penalty for each subsequent residue in the gap. This is the most 
commonly used gap scoring system. High gap penalties will of coin-se produce 

20 optimised alignments with fewer gaps. Most alignment programs allow the gap 
penalties to be modified. However, it is preferred to use the default values using 
such software for sequence comparisons. For example when using the GCG 
Wisconsin Bestfit package the default gap penalty for amino acid sequences is -12 for 
a gap and -4 for each extension. 

25 

Calculation of maximum % homology therefore jBrsdy requires the production of an 
optimal alignment, taking into consideration gap penalties. A suitable computer 
program for carrying out such an alignment is the GCG Wisconsin Bestfit package 
(Devereux et al 1984 Nuc. Acids Research 12 p387). Examples of other softvrare that 
30 can perform sequence comparisons include, but are not limited to, the BLAST package 
(see Ausubel et al 1999 Short Protocols in Molecular Biology, 4* Ed - Chapter 18), 
FASTA (Altschul et al 1990 J. Mol. Biol. 403-410) and the GENEWORKS suite of 
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comparison tools. Both BLAST and FASTA are avaUable for ofQine and online 
searching (see Ausubel et al 1999, pages 7-58 to 7-60). However, for some 
applications, it is preferred to use the GCG Bestfit program. A new tool, called 
BLAST 2 Sequences is also available for comparing protein and nucleotide sequence 
5 (see FEMS Microbiol Lett 1999 174(2): 247-50; FEMS Microbiol Lett 1999 177(1): 
1 87-8 and tatiana@ncbi.nlni.nili.gov). 

Although the final % homology can be measured in terms of identity, the alignment 
process itself is typically not based on an all-or-nothing pair comparison. Instead, a 
10 scaled similarity score matrix is generally used that assigns scores to each pairwise 
comparison based on chemical similarity or evolutionary distance. An example of 
such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied (see user manual for 
15 further details). For some applications, it is preferred to use the public default values 
for the GCG package, or in the case of other software, the defauh matrix, such as 
BLOSUM62. 

Alternatively, percentage homologies may be calculated using the multiple alignment 
feature in DNASIS™ (Hitachi Software), based on an algorithm, analogous to 
CLUSTAL (Higgins DG & Sharp PM (1988), Gene 73(1), 237-244). 

Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part of 
the sequence comparison and generates a numerical result 

The sequences may also have deletions, insertions or substitutions of amino acid 
residues vrfiich produce a silent change and result in a fimctionally equivalent 
substance. Deliberate amino acid substitutions may be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues as long as the secondary binding activity of the substance is 
retained. For example, negatively charged amino acids include aspartic acid and 
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glutamic acid; positively charged amino acids include lysine and arginine; and amino 
acids with uncharged polar head groups having similar hydrophilicity values include 
leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, 
phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example according to the Table below. 
Amino acids in the same block in the second column and preferably in the same line in 
the third colimm may be substituted for each other: 


ALIPHATIC 

Non-polar 

GAP 

ILV 

Polar - uncharged 

CSTM 

NQ 

Polar— charged 

DE 

KR 

AROMATIC 


HFWY 


The present invention also encompasses homologous substitution (substitution and 
replacement are both used herein to mean the interchange of an existing amino acid 
residue, with an alternative residue) that may occur i.e. like-for-like substitution such 
as basic for basic, acidic for acidic, polar for polar etc. Non-homologous substitution 
may also occur i.e. from one class of residue to another or alternatively involving the 
inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), 
diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine 
(hereinafter referred to as O), pyriylalanine, thienylalanine, naphthylalanine and 
phenylglycine. 


Replacements may also be made by uimatural amino acids. 
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Variant amino acid sequences may include suitable spacer groups that may be inserted 
between any two amino acid residues of the sequence including alkyl groups such as 
methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or p- 
alanine residues. A further form of variation, involves the presence of one or more 
5 amino acid residues in peptoid form, will be well understood by those skilled in the art. 
For the avoidance of doubt, *^e peptoid form" is used to refer to variant amino acid 
residues wherein the a-carbon substituent group is on the residue's nitrogen atom 
rather than the a-carbon. Processes for preparing peptides in the peptoid form are 
known in the art, for example Simon RJ et al., PNAS (1992) 89(20), 9367-9371 and 
10 Horwell DC, Trends Biotechnol. (1995) 13(4), 132-134. 

Nucleotide sequences for use in the present invention or encoding a polypeptide 
having the specific properties defined herein may include within them synthetic or 
modified nucleotides. A number of different types of modification to oligonucleotides 

15 are known in the art. These include methylphosphonate and phosphorothioate 
backbones and/or the addition of acridine or polylysine chains at the 3* and/or 5' ends 
of the molecule. For the ptuposes of the present invention, it is to be understood that 
the nucleotide sequences described hereiD may be modified by any method available in 
the art. Such modifications may be carried out in order to enhance the in vivo activity 

20 or life span of nucleotide sequences. 

The present invention also encompasses the use of nucleotide sequences that are 
complementary to the sequences discussed herein, or any derivative, fi-agment or 
derivative thereof. If the sequence is complementary to a fi:Hgment thereof then that 
25 sequence can be used as a probe to identify similar coding sequences in other 
organisms etc. 

Polynucleotides which are not 100% homologous to the sequences of the present 
invention but faU within the scope of the invention can be obtained in a number of ways. 
30 Other variants of the sequences described herein may be obtained for example by probing 
DNA libraries made from a range of individuals, for example individuals fi-om different 
populations. In addition, other viral/bacterial, or cellular homologues particularly cellular 
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homologues found in mammalian cells (e.g. rat, mouse, bovine and primate cells), may 
be obtained and such homologues and fragments thereof in general will be c^jable of 
selectively hybridising to the sequences shown in the sequence listing herein. Such 
sequences may be obtained by probing cDNA libraries made &om or genomic DNA 
5 libraries from o&er animal species, and probing such hbraries with probes comprising all 
or part of any one of the sequences in the attached sequence hstings under conditions of 
medium to high stringency. Similar considarations apply to obtaining species 
homologues and aUelic variants of the polypeptide or nucleotide sequences of the 
invention. 

10 

Variants and strain/species homologues may also be obtained using degenerate PGR 
which will use primers designed to target sequences within the variants and homologues 
encoding conserved amino acid sequences within the sequences of the present inventioiL 
Conserved sequences can be predicted, for example, by aligning the amino acid 
15 sequences fix>m several variants/homologues. Sequence alignments can be performed 
using computer software known in the art. For example the GCG Wisconsin PUeUp 
program is widely used. 

The primers used in degenerate PGR will contain one or more degenerate positions and 
20 will be used at stringency conditions lower than those used for cloning sequences with 
single sequence primers against known sequences. 

Alternatively, such polynucleotides may be obtained by site directed mutagenesis of 
characterised sequences. This may be useful v\diere for example silent codon sequence 
25 changes are required to optimise codon preferences for a particular host cell in which the 
polynucleotide sequences are being expressed. Other sequence changes may be desired 
in order to introduce restriction polypq)tide recognition sites, or to alter the property or 
fimction of the polypeptides encoded by the polynucleotides. 

30 Polynucleotides (nucleotide sequences) of the invention may be used to produce a primer, 
e.g. a PGR primer, a primer for an alternative amplification reaction, a probe e.g. labeUed 
with a revealing label by conventional means using radioactive or non-radioactive labels. 
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or the polynucleotides may be cloned into vectors. Such primers, probes and other 
firagments will be at least 15, preferably at least 20, for example at least 25, 30 or 40 
nucleotides in length, and are also encompassed by the term polynucleotides of the 
invention as used herein. 

5 

Polynucleotides such as DNA polynucleotides and probes according to the invention may 
be produced recombinantly, synthetically, or by any means available to those of skill in 
the ait They may also be cloned by standard techniques. 

10 hi general, primers will be produced by synthetic means, involving a stepwise 
mantifecture of the desired nucleic acid sequence one nucleotide at a time. Techniques 
for accomplishing this using automated techniques are readily available in the art. 

Longer polynucleotides will generally be produced using recombinant means, for 
15 example using a PGR (polymerase chain reaction) cloning techniques. This will involve 
making a pair of primers (e.g. of about 1 5 to 30 nucleotides) flanking a region of the lipid 
targeting sequence which it is desired to clone, bringing the primers into contact with 
mRNA or cDNA obtained from an animal or human cell, performing a polymerase chain 
reaction under conditions which bring about amphfication of the desired region, isolating 
20 the amplified fragment (e.g. by pvuifying the reaction mixture on an agarose gel) and 
recovering the ampliBed DNA. The primers may be designed to contain suitable 
restriction enzyme recognition sites so that the amplified DNA can be cloned into a 
suitable cloning vector. 

HYBRIDISATION 

25 

The present invention also encompasses sequences that are complementary to the 
sequences of the present invention or sequences that are capable of hybridising either 
to the sequences of the present invention or to sequences that are complementary 
thereto. 


30 
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10 


20 


25 


30 


Tlie tenn "hybridisation" as used herein shall include "the process by which a strand of 
nucleic acid joins with a complementary strand through base pairing" as well as the 
process of amplification as carried out in polymerase chain reaction (PGR) 
technologies. 

The present invention also encompasses the use of nucleotide sequences that are 
capable of hybridising to the sequences that are complementary to the subject 
sequences discussed herein, or any derivative, fiagment or derivative thereof. 

The present invention also encompasses sequences tiiat are complementary to 
sequences that are capable of hybridising to the nucleotide sequences discussed herein. 


Hybridisation conditions are based on the melting temperature (Tm) of the nucleotide 
binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
15 Techniques, Methods in Enzymology, Vol. 1 52, Academic Press, San Diego CA), and 
confer a defined "stringency" as explained below. 

Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm of flie 
probe); high stringency at about 5«'C to 10<»C below Tm; intermediate stringency at 
about lO^C to 20°C below Tm; and low stringency at about 20°C to 25°C below Tm. 
As will be understood by those of skill in the art, a maximum stringency hybridisation 
can be used to identify or detect identical nucleotide sequences while an intermediate 
(or low) stringency hybridisation can be used to identify or detect similar or related 
polynucleotide sequences. 


Preferably, tiie present invention encompasses sequences that are complementary to 
sequences that are capable of hybridising under high stringency conditions or 
intermediate stringency conditions to nucleotide sequences encoding polypeptides 
having the specific properties as defined herein. 

More preferably, the present invention encompasses sequences that are complementary 
to sequences that are capable of hybridising under high stringent conditions (e.g. 65°C 
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and OJxSSC {IxSSC - 0.15 M NaCl, 0.015 M Na-citiate pH 7.0}) to nucleotide 
sequences encoding polypeptides having the specific properties as defined herein. 

The present invention also relates to nucleotide sequences that can hybridise to the 
5 nucleotide sequences discussed herein (including complementary sequences of those 
discussed herein). 

The present invention also relates to nucleotide sequences that are complementary to 
sequences that can hybridise to the nucleotide sequences discussed herein (including 
10 complementary sequences of those discussed herein). 

Also included within the scope of the present invention are polynucleotide sequences 
that are capable of hybridising to the nucleotide sequences discussed herein under 
conditions of intermediate to maximal stringency. 

15 

In a preferred aspect, the present invention covers nucleotide sequences that can 
hybridise to the nucleotide sequences discussed herein, or the complement thereof, 
under stringent conditions (e.g. 50^C and 0.2xSSC). 

20 In a more preferred aspect, the present invention covers nucleotide sequences that can 
hybridise to the nucleotide sequences discussed herein, or the complement thereof, 
under high stringent conditions (e.g. 65°C and O.lxSSC). 

EXPRESSION OF POLYPEPTIDES 

25 

A nucleotide sequence for use in the present invention or for encoding a polypeptide 
having the specific properties as defined herein can be incorporated into a recombinant 
replicable vector. The vector may be used to replicate and express the nucleotide 
sequence, in polypeptide fomi, in and/or Jfrom a compatible host cell. Expression may 
30 be controlled using control sequences which include promoters/enhancers and other 
expression regulation signals. Prokaryotic promoters and promoters functional in 
eukaryotic cells may be U3ed. Tissue speciJBc or stimuli specific promoters may be 
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used. Chimeric promoters may also be used comprising sequence elements from two 
or more different promoters described above. 

The polypeptide produced by a host recombinant cell by expression of the nucleotide 
5 sequence may be secreted or may be contained intracellularly depending on the 
sequence and/or the vector used. The coding sequences can be designed with signal 
sequences which direct secretion of the substance coding sequences through a 
particular prokaryotic or eukaryotic cell membrane. 

10 EXPRESSION VECTOR 

The term "expression vector" means a construct capable of in vivo or in vitro expression. 

Preferably, the expression vector is incorporated in the genome of the organism. The term 
1 5 "incorporated" preferably covers stable incorporation into the genome. 

The nucleotide sequence of the present invention or coding for a polypeptide having 
the specific properties as defined herein may be present in a vector, in which the 
nucleotide sequence is operably linked to regulatory sequences such that the regulatory 
20 sequences are capable of providing the expression of the nucleotide sequence by a 
suitable host organism, i.e. the vector is an expression vector. 

The vectors of the present invention may be transformed into a suitable host cell as 
described below to provide for expression of a polypeptide having the specific 
25 properties as defined herein. 

The choice of vector, e.g. plasmid, cosmid, virus or phage vector, will often depend on 
the host cell into which it is to be introduced. 

30 The vectors may contain one or more selectable marker genes — such as a gene which 
confers antibiotic resistance e.g. ampicillin, kanamycin, chloramphenicol or tetracyclin 
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resistance. Alternatively, the selection may be accomplished by co-transfonnation (as 
described in W091/17243). 

Vectors may be used in vitro, for example for the production of RNA or used to 
transfect or transfomi a host cell. 

Thus, in a fiirther embodiment, the invention provides a method of making nucleotide 
sequences of the present invention or nucleotide sequences encoding polypeptides 
having the specific properties as defined herein by introducing a nucleotide sequence 
into a replicable vector, introducing the vector into a compatible host cell, and growing 
the host cell under conditions which bring about replication of the vector. 

The vector may fiirther comprise a nucleotide sequence enabling the vector to replicate 
in the host cell in question. Examples of such sequences are the origins of replication 
of plasmidspUC19, pACYC177, pUBl 10, pE194, pAMBl and pU702. 

REGULATORY SEQUENCES 

In some applications, a nucleotide sequence for use in the present invention or a 
nucleotide sequence encoding a polypeptide having the specific properties as defined 
herein may be operably linked to a regulatory sequence which is capable of providing 
for the expression of the nucleotide sequence, such as by the chosen host cell. By way 
of example, the present invention covers a vector comprising the nucleotide sequence 
of the present invention operably linked to such a regulatory sequence, i.e. the vector is 
an expression vector. 

The term "operably linked" refers to a juxtaposition wherein the components described 
are in a relationship permitting them to fimction in their intended manner. A 
regulatory sequence "operably linked" to a coding sequence is ligated in such a way 
that expression of the coding sequence is achieved under conditions compatible with 
the control sequences. 
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The tenn "regulatory sequences" includes promoters and enhancers and other 
expression regulation signals. 

The term "promoter" is used in the normal sense of the art, e.g. an KNA polymerase 
5 binding site. 

Enhanced expression of the nucleotide sequence encoding the enzyme having the 
specific properties as defined herein may also be achieved by the selection of 
heterologous regulatory regions, e.g. promoter, secretion leader and terminator 
10 regions. 

Preferably, the nucleotide sequence of the present invention may be operably linked to at 
least a promoter. 

15 Examples of suitable promoters for directing the transcription of the nucleotide 
sequence in a bacterial, fimgal or yeast host are well known in the art. 

CONSTRUCTS 

The term "construct" - vMch is synonymous with terms such as "conjugate", "cassette" 
20 and "hybrid" - includes a nucleotide sequence encoding a polypeptide having the specific 
properties as defined herein for use according to the present invention directiy or 
indirectly attached to a promoter. An example of an indirect attachment is the provision 
of a suitable spacer groiq? such as an intron sequence, such as the Shl-intron or the ADH 
intron, intermediate the promoter and the nucleotide sequence of the present invention. 
25 The same is true for the tenn "fiised" in relation to the present invention which includes 
direct or indirect attachment In some cases, the terms do not cover the natural 
combination of the nucleotide sequence coding for the protein ordinarily associated with 
the wild type gene promoter and when they are both in their natural environment 

30 The construct may even contain or express a marker which allows for the selection of the 
genetic construct 


r 
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For some applications, preferably the construct comprises at least a nucleotide 
sequence of the present invention or a nucleotide sequence encoding a polypeptide 
having the specific properties as defined herein operably linked to a promoter. 

5 

HOST CELLS 

The term "host cell" - in relation to the present invention includes asiy cell that 
10 comprises either a nucleotide sequence encoding a polypeptide having the specific 
properties as defined herein or an expression vector as described above and which is 
used in the recombinant production of a polypeptide having the specific properties as 
defined herein. 

15 Thus, a fiirther embodiment of the present invention provides host cells transformed or 
transfected with a nucleotide sequence of the present invention or a nucleotide 
sequence that expresses a polypeptide having the specific properties as defined herein. 
The cells will be chosen to be compatible with the said vector and may for example be 
prokaryotic (for example bacterial), fimgal, yeast or plant cells. Preferably, the host 

20 cells are not himian cells. 

Examples of suitable bacterial host organisms are gram negative bacteritmi or gram 
positive bacteria, 

25 Depending on the nature of the nucleotide sequence encoding a polypeptide having the 
specific properties as defined herein, and/or the desirability for fiirther processing of 
the expressed protein, eukaryotic hosts such as yeasts or other fimgi may be preferred. 
In general, yeast cells arie preferred over iungal cells because they axe easier to 
manipulate. However, some proteins are either poorly secreted from the yeast cell, or 

30 in some cases are not processed properly (e.g. hyperglycosylation in yeast). In these 
instances, a different fimgal host organism should be selected. 
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The use of suitable host cells, such as yeast, fungal and plant host cells - may provide 
for post-translational modifications (e.g. myristoylation, glycosylation, truncation, 
lapidation and tyrosine, serine or threonine phosphorylation) as may be needed to 
confer optimal biological activity on recombinant expression products of the present 
5 invention. 

The host cell may be a protease deficient or protease minus strain. 
ORGANISM 

10 

The terai "organism" in relation to the present invention includes any organism that 
could comprise a nucleotide sequence according to the present invention or a 
nucleotide sequence encoding for a polypeptide having the specific properties as 
defined herein and/or products obtained therefrom. 

15 

Suitable organisms may include aprokaiyote, firngus, yeast or a plant. 

The temi "transgenic organism" in relation to the present invention includes any 
organism that comprises a nucleotide sequence coding for a polypeptide having the 
20 specific properties as defined herein and/or the products obtained therefirom, and/or 
wherein a promoter can allov^ expression of the nucleotide sequence coding for a 
polypeptide having the specific properties as defined herein within the organism. 
Preferably the nucleotide sequence is incorporated in the genome of the organism. 

25 The term ''transgenic organism" does not cover native nucleotide coding sequences in 
their natural environment when they are under the control of their native promoter 
which is also in its natural environment. 

Therefore, the transgenic organism of the present invention includes an organism 
30 comprising any one of, or combinations of, a nucleotide sequence coding for a 
polypeptide having the specific properties as defined herein, constructs as defined 
herein, vectors as defined herein, plasmids as defined herein, cells as defined herein, or 
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the products thereof. For example the transgenic organism can also comprise a 
nucleotide sequence coding for a polypeptide having the specific properties as defined 
herein under the control of a heterologous promoter. 


TRANSFORMATION OF HOST CELLS/ORGANISM 

As indicated earlier, the host organism can be a prokaryotic or a eukaryotic organism. 
Examples of suitable prokaryotic hosts include E. coli and Bacillus subtilis. 

Teachings on the transformation of prokaryotic hosts is well documented in the art, for 
example see Sambrook et al (Molecular Cloning: A Laboratory Manual, 2nd edition, 
1989, Cold Spring Harbor Laboratoiy Press). If a prokaryotic host is used then the 
nucleotide sequence may need to be suitably modified before transformation - such as 
by removal of introns. 

In another embodiment the transgenic organism can be a yeast. 

Filamentous fungi cells may be transfomied using various methods known in the art - 
such as a process involving protoplast formation and transformation of the protoplasts 
followed by regeneration of the cell wall in a manner known. The use of Aspergillzis 
as a host microorganism is described in EP 0 238 023. 

Another host organism can be a plant. A review of the general techniques used for 
transforming plants may be found in articles by Potrykus {Annu Rev Plant Physiol 
Plant Mol Biol [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech 
March/April 1994 17-27). Further teachings on plant transforation may be found in 
EP-A-0449375. 

General teachings on the transformation of fungi, yeasts and plants are presented in 
following sections. 
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TRANSFORMED BACTERIA 

A host organism may be a bacterium, such as Streptomyces, Bacillus subtillis or E,coli. 
5 Suitable methods of heterologous expression in Kcoli are disclosed in WO04/064537. 
Suitable methods of heterologous expression in Bacillus are disclosed in 
WO02/214490. Examples of suitable bacterial host organisms are gram positive 
bacterial species such ay Bacillaceae, including Bacillus subtilis. Bacillus _ 
licheniformis, Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus. Bacillus 

10 alkalophilus. Bacillus amyloliquefaciens. Bacillus coagulans. Bacillus lautus. Bacillus 
megaterium and Bacillus thuringiensis, Streptomyces species, such as Streptomyces 
murinus, lactic acid bacterial species, including Lactococcus spp., such as Lactococcus 
lactis, Lactobacillus spp., including Lactobacillus reuteri,, Leuconostoc spp., 
Pediococcus spp. and Streptococcus spp. Alternatively, strains of a gram-negative 

15 bacterial species belonging to Enterobacteriaceae, including E. coli, or to 
Pseudomonadaceae can be selected as the host organism. 

TRANSFORMED FUNGUS 

20 A host organism may be a fungus - such as a filamentous fungus. Examples of suitable 
such hosts include any member belonging to the genera Thermomyces, Acremonium, 
Aspergillus, Penicillium, Mucor, Neurospora, Trichoderma and the like. 

Teachings on transforming filamentous fungi are reviewed in US-A-5741665 which 
25 states that standard techniques for transformation of filamentous fimgi and culturing 
the fungi are well known in the art. An extensive review of techniques as applied to K 
crassa is found, for example in Davis and de Serres, Methods Enzymol (1971) 17A: 
79-143. 

30 Further teachings on transforming filamentous fimgi are reviewed in US-A-5674707. 
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In one aspect, the host organism can be of the genus Aspergillus, such as Aspergillus 
niger. 

5 A transgenic Aspergillus according to the present invention can also be prepared by 
following, for example, the teachings of Turner G. 1994 (Vectors for genetic 
manipulation. In: Martinelli S.D., Kinghom J.R.( Editors) Aspergillus: 50 years on. 
Progress in industrial microbiology vol 29. Elsevier Amsterdam 1994. pp. 641-666), 

10 Gene expression in filamentous fungi has been reviewed in Punt et aL (2002) Trends 
Biotechnol 2002 May;20(5):200-6, Archer & Peberdy Crit Rev Biotechnol (1997) 
17(4):273-306. 

TRANSFORMED YEAST 

15 

In another embodiment, the transgenic organism can be a yeast. 

A review of the principles of heterologous gene expression in yeast are provided in, for 
example. Methods Mol Biol (1995), 49:341-54, and Curr Opin Biotechnol (1997) 
20 Oct;8(5):554-60 

In this regard, yeast — such as the species Saccharomyces cerevisi or Pichia pastoris (see 
FEMS Microbiol Rev (2000 24(l):45-66), may be used as a vehicle for heterologous 
gene expression. 

25 

A review of the principles of heterologous gene expression in Saccharomyces cerevisiae 
and secretion of gene products is given by E Hinchcliffe E Kenny (1993, "Yeast as a 
vehicle for the expression of hetoiologous genes". Yeasts, Vol 5, Anthony H Rose and 
J Stuart Harrison, eds, 2nd edition. Academic Press Ltd.). 

30 

For the transformation of yeast, several transformation protocols have been develoF>ei 
For exan[q)le, a transgaiic Saccharomyces according to the present invention can be 
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prepared by following the teachings of Hinnen et aLy (1978, Proceedings of the National 
Academy of Sciences of the USA 75, 1929); Beggs, J D (1978, Natia-e, London, 275, 
104); and Ito, H et al (1983, J Bacteriology 153, 163-168). 

5 The transformed yeast cells may be selected using various selective markers — such as 
auxotrophic markers dominant antibiotic resistance markers. 

A suitable yeast host organism can be selected from the biotechnologically relevant 
yeasts species such as, but not limited to, yeast species selected from Pichia spp., 
10 Hansenula spp., Kluyveromyces, Yarrowinia spp., Saccharomyces spp., including S. 
cerevisiae, or Schizosaccharomyce spp. including Schizosaccharomyce pombe. 

A strain of the methylotrophic yeast species Pichia pastoris may be used as the host 
organism. 

15 

In one embodiment, the host organism may be a Hansenula species, such as K 
polymorpha (as described in WOO 1/39544). 

TRANSFORMED PLANTS/PLANT CELLS 

20 

A host organism suitable for the present invention may be a plant. A review of the 
general techniques may be found in articles by Potrykus {Anmi Rev Plant Physiol Plant 
Mol Biol [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech Marx:h/April 
1994 17-27), or in WOOl/16308. The transgenic plant may produce enhanced levels of 
25 phytosterol esters and phytostanol esters, for example. 

Therefore the present invention also relates to a method for the production of a transgenic 
plant with enhanced levels of phytosterol esters and phytostanol esters, comprising the 
steps of transforming a plant cell with a hpid acyltransferase as defined herein (in 
30 particular with an expression vector or construct comprising a lipid acyltransferase as 
defined herein), and growing a plant from the transformed plant cell. 
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SECRETION 

Often, it is desirable for the polypeptide to be secreted from the expression host into 
the culture medium from where the enzyme may be more easily recovered. According 
5 to the present invention, the secretion leader sequence may be selected on the basis of 
the desired expression host Hybrid signal sequences may also be used with the 
context of the present invention. 

Typical examples of heterologous secretion leader sequences are those originating 
10 from the fimgal amyloglucosidase (AG) gene {glaA - both 18 and 24 amino acid 
versions e.g. from Aspergillus\ the a-factor gene (yeasts e.g. Saccharomyces, 
Kluyveromyces and Hansenula) or the a-amylase gene (Bacillus), 

DETECTION 

15 

A variety of protocols for detecting and measuring the expression of the amino acid 
sequence are known in the art. Examples include enzyme-linked inununosorbent 
assay (ELISA), radioimmimoassay (RIA) and fluorescent activated cell sorting 
(FACS). 

20 

A wide variety of labels and conjugation techniques are known by those skilled in the 
art and can be used in various nucleic and amino acid assays. 

A number of companies such as Pharmacia Biotech (Piscataway, NJ), Promega 
25 (Madison, WI), and US Biochemical Corp (Cleveland, OH) supply commercial kits 
and protocols for these procedures. 

Suitable reporter molecules or labels include those radionuclides, enemies, 
fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, 
30 inhibitors, magnetic particles and the like. Pataits teaching the use of such labels 
mclude US-A-3,8 17,837; US-A-3,850,752; US-A-3,939,350; US-A-3,996,345; US-A- 
4,277,437; US-A^,275,149 and US-A-4,366,241. 
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Also, recombinant unmunoglobulins may be produced as shown in US-A-4,8 16,567. 

5 FUSION PROTEINS 

A polypeptide having the specific properties as defined herein may be produced as a 
fusion protein, for example to aid in extraction and purification thereof. Examples of 
fiision protein partners include glutathione-S-transferase (GST), 6xHis, GAL4 (DNA 
10 binding and/or transcriptional activation domains) and P-galactosidase. It may also be 
convenient to include a proteolytic cleavage site between the fusion protein partner 
and the protein sequence of interest to allow removal of fusion protein sequences. 
Preferably the fusion protein will not hinder the activity of the protein sequence. 

15 Gene fusion expression systems in E. coli have been reviewed in Curr. Opin. 
Biotechnol. (1995) 6(5):501-6. 

In another embodiment of the invention, the amino acid sequence of a polypeptide 
having the specific properties as defined herein may be ligated to a heterologous 
20 sequence to encode a fusion protein. For example, for screening of peptide Ubraries for 
agents capable of affecting the substance activity, it may be useful to encode a 
chimeric substance expressing a heterologous epitope that is recognised by a 
commercially available antibody. 

25 The invention will now be described, by way of example only, with reference to the 
following Figures and Examples. 

Figure 1 shows a pfam00657 consensus sequence from database version 6 (SEQ ID 
No. 1); 


30 
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Figure 2 shows an amino acid sequence (SEQ ID No. 2) obtained from the organism 
Aeromonas hydrophila (PI 0480; GI: 121 051). This amino acid sequence is a reference 
en2yme, which may be a parent enzyme in accordance with the present invention; 

5 Figure 3 shows an amino acid sequence (SEQ ID No. 3) obtained from the organism 
Aeromonas salmonicida (AAG098404; 01:9964017); 

Figure 4 shows an amino acid sequence (SEQ ID No. 4) obtained from the organism 
Streptomyces coelicolor A3(2) (Genbank accession ninnber NP_63 1 558); 

10 

Figure 5 shows an amino acid sequence (SEQ ID No. 5) obtained from the organism 
Streptomyces coelicolor A3(2) (Genbank accession number CAC42140); 

Figure 6 shows an amino acid sequence (SEQ ID No. 6) obtained from the organism 
1 5 Saccharomyces cerevisiae (Genbank accession number P41 734); 

Figure 7 shows an aUgnment of selected sequences to pfam00657 consensus sequence; 

Figure 8 shows a pairwise aUgnment of SEQ ID No. 3 with SEQ ID No. 2 showing 
20 93% amino acid sequence identity. The signal sequence is underlined. + denotes 
differences. The GDSX motif containing the active site serine 16, and the active sites 
aspartic acid 116 and histidine 291 are highlighted (see shaded regions). Numbers after 
the amino acid is minus the signal sequence; 

25 Figure 9 shows a nucleotide sequence (SEQ ID No. 7) encoding a lipid acyl transferase 
according to the present invention obtained from the organism Aeromonas hydrophila; 


30 


Figure 10 shows a nucleotide sequence (SEQ ID No. 8) encoding a lipid acyl 
transferase accordiog to the present invention obtained from the organism Aeromonas 
salmonicida; 
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Figure 11 shows a nucleotide sequence (SEQ ED No. 9) encoding a lipid acyl 
transferase according to the present invention obtained from the organism 
Streptomyces coelicolor A3(2) (Genbank accession number 
NC_003888.1:8327480..8328367); 

5 

Figure 12 shows a nucleotide sequence (SEQ ID No. 10) encoding a lipid acyl 
transferase according to the present invention obtained from the organism 
Streptomyces coelicolor A3(2) (Genbank accession number 
AL93913 LI :265480„266367); 

10 

Figure 13 shows a nucleotide sequence (SEQ ID No. 11) encoding a lipid acyl 
transferase according to the present invention obtained from the organism 
Saccharomyces cerevisiae (Genbank accession number Z75034); 

15 Figure 14 shows an amino acid sequence (SEQ ID No. 12) obtained from the organism 
Ralstonia (Genbank accession number: AL646052); 

Figure 15 shows a nucleotide sequence (SEQ ID No. 13) encoding a lipid acyl 
transferase according to the present invention obtained from the organism Ralstonia; 

20 

Figure 16 shows SEQ ID No. 14. Scoel NCBI protein accession code CAB39707.1 
GI:4539178 conserved hypothetical protein [Streptomyces coelicolor A3(2)]; 

Figure 17 shows a nucleotide sequence shown as SEQ ID No. 15 encoding NCBI 
25 protein accession code CAB39707.1 GI:4539178 conserved hypothetical protein 
[Streptomyces coelicolor A3(2)]; 

Figure 18 shows an amino acid shown as SEQ ID No. 16. Scoe2 NCBI protein 
accession code CAC01477.1 GI:9716139 conserved hypothetical protein 
30 [Streptomyces coelicolor A3(2)]; 
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Figure 19 shows a nucleotide sequence shown as SEQ ID No. 17 encoding Scoe2 
NCBI protein accession code CAC01477.1 GI:9716139 conserved hypothetical 
protein [Streptomyces coelicolor A3(2)]; 

5 Figure 20 shows an amino acid sequence (SEQ ID No. 18) Scoe3 NCBI protein 
accession code CAB88833.1 01:7635996 putative secreted protein. [Streptomyces 
coelicolor A3(2)]; 

Figure 21 shows a nucleotide sequence shown as SEQ ID No. 19 encoding Scoe3 
10 NCBI protem accession code CAB88833.1 01:7635996 putative secreted protein. 
[Streptomyces coelicolor A3 (2)] ; 

Figure 22 shows an amino acid sequence (SEQ ID No. 20) Scoe4 NCBI protein 
accession code CAB89450.1 01:7672261 putative secreted protein. [Streptomyces 
15 coelicolor A3(2)]; 

Figure 23 shows an nucleotide sequence shown as SEQ ID No. 21 encoding Scoe4 
NCBI protein accession code CAB89450.1 01:7672261 putative secreted protein. 
[Streptomyces coelicolor A3(2)]; 

20 

Figure 24 shows an amino acid sequence (SEQ ID No. 22) Scoe5 NCBI protein 
accession code CAB62724.1 01:6562793 putative Upoprotein [Streptomyces 
coelicolor A3(2)]; 

25 Figure 25 shows a nucleotide sequence shown as SEQ ID No. 23, encoding Scoe5 
NCBI protein accession code CAB62724.1 01:6562793 putative lipoprotein 
[Streptomyces coelicolor A3(2)]; 

Figure 26 shows an amino acid sequence (SEQ ID No. 24) Sriml NCBI protein 
30 accession code AAK84028. 1 OI: 1 5082088 ODSL-lipase [Streptomyces rimosus]; 
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Figure 27 shows a nucleotide sequence shown as SEQ ID No. 25 encoding Sriml 
NCBI protein accession code AAK84028.1 GI:15082088 GDSL-lipase [Streptomyces 
rimosus]; 

5 Figure 28 shows an amino acid sequence (SEQ ID No. 26) - a Upid acyl transferase 
from Aeromonas hydrophila (ATCC #7965); 

Figure 29 shows a nucleotide sequence (SEQ ID No. 27) encoding a lipid 
acyltransferase from Aeromonas hydrophila (ATCC #7965); 

10 

Figure 30 shows an amino acid sequence (SEQ ID No. 28) of a lipid acyltransferase 
from Aeromonas salmonicida subsp. Salmonicida (ATCC#14174); 

Figure 31 shows a nucleotide sequence (SEQ ID No. 29) encoding a lipid 
15 acyltransferase from Aeromonas salmonicida subsp. Salmonicida (ATCC#14174); 

Figure 32 shows that homologues of the Aeromonas genes can be identified using the 
basic local alignment search tool service at the National Center for Biotechnology 
Information, NIH, MD, USA and the completed genome databases. The GDSX motif 

20 was used in the database search and a number of sequences/genes potentially encoding 
enzymes with lipolytic activity were identified. Genes were identified from the genus 
Streptomyces, Xanthomonas and Ralstonia. As an example below, the Ralstonia 
solanacearum was aligned to the Aeromonas salmonicida (satA) gene. Pairwise 
aligimient showed 23% identity. The active site serine is present at the amino terminus 

25 and the catalytic residues histidine and aspartic acid can be identified; 

Figure 33 shows the Pfam00657.11 [family 00657, database version 11] consensus 
sequence (hereafter called Pfam consensus) and the alignment of various sequences to 
the Pfam consensus sequence. The arrov^ indicate the active site residues, the 
30 underlined boxes indicate three of the homology boxes indicated by [Upton C and 
Buckley JT (1995) Trends Biochem Sci 20; 179-179]. Capital letters in the Pfam 
consensus indicate conserved residues in many family members. The — symbol 
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indicates a position where the hidden Markov model of the Pfam consensus expected 
to find a residue but did not, so a gap is inserted. The . symbol indicates a residue 
without a corresponding residue in the Pfam consensus. The sequences are the amino 
acid sequences listed in Figures 16, 18, 20, 22, 24, 26, 28 and 30. 

5 

Figure 34 shows the Pfam00657.11 [family 00657, database version 11] consensus 
sequence (hereafter called Pfam consensus) and the alignment of various sequences to 
the Pfam consensus sequence. The arrows indicate the active site residues, the 
underlined boxes indicate three of the homology boxes indicated by [Upton C and 

10 Buckley JT (1995) Trends Biochem Sci 20; 179-179]. Capital letters in the Pfam 
consensus indicate conserved residues in many family members. The - symbol 
indicates a position where the hidden Markov model of the Pfam consensus expected 
to find a residue but did not, so a gap is inserted. The . symbol indicates a residue 
without a corresponding residue in the Pfam consensus. The sequences are the amino 

15 acid sequences listed in Figures 2, 16, 18, 20, 26, 28 and 30. All these proteins were 
found to be active against lipid substrates. 

Figure 35 shows an anaino acid sequence (SEQ ID No. 30) of the fiision construct used 
for mutagenesis of the Aeromonas hydrophila lipid acyltransferase gene in Example 7. 
20 The underlined amino acids is a xylanase signal peptide; 

Figure 36 shows a nucleotide sequence (SEQ ID No. 31) encoding a lipid 
acyltransferase enzyme fi-om Aeromonas hydrophila including a xylanase signal 
peptide; 

25 

Figure 37 shows a nucleotide sequence encoding a lipid acyltransferase enzyme fi-om 
Streptomyces (SEQ ID No. 32); 

Figure 38 shows a polypeptide sequence of a lipid acyltransferase enzyme firom 
30 Streptomyces (SEQ ID No. 33); 
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Figure 39 shows a polypeptide sequence of a lipid acyltransferase enzyme J&om 
TermobifidaXSEQ ID No. 34); 

Figure 40 shows a nucleotide sequence encoding a lipid acyltransferase enzyme from 
5 TermobifidaXSEQ ID No. 35); 

Figure 41 shows a polypeptide sequence of a lipid acyltransferase enzyme from 
TermobifidaXSEQ ID No. 36); 

10 Figure 42 shows a polypeptide of a lipid acyltransferase enzyme from 
Corynebacterium\effciens\ GDSx 300 aa.(SEQ ID No. 37); 

Figure 43 shows a nucleotide sequence encoding a Upid acyltransferase enzyme from 
Corynebacterium\effciens\ GDSx 300 aa_(SEQ ID No. 38); 

15 

Figure 44 shows a polypeptide of a lipid acyltransferase enzyme from 
Novosphingobium\aromaticivorans\ GDSx 284 aa_(SEQ ID No. 39); 

Figure 45 shows a nucleotide sequence encoding a lipid acyltransferase enzyme from 
20 Novosphingobium\aromaticivorans\ GDSx 284 aa (SEQ ID No. 40); 

Figure 46 shows a polypeptide of a lipid acyltransferase enzyme from Streptomyces 
coelicolor\ GDSx 268 aa (SEQ ID No. 41); 

25 Figure 47 shows a nucleotide sequence encoding a lipid acyltransferase enzyme from 
Streptomyces coelicolorX GDSx 268 aa (SEQ ID No. 42); 

Figure 48 shows a polypeptide of a lipid acyltransferase enzyme from Streptomyces 
avermitilis \ GDSx 269 aa (SEQ ID No. 43); 

30 

Figure 49 shows a nucleotide sequence encoding a lipid acyltransferase enzyme from 
Streptomyces avermitilis \ GDSx 269 aa (SEQ ID No. 44); 
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Figure 50 shows a polypeptide of a lipid acyltransferase enzyme from Streptomyces 
(SEQIDNo.45); 

Figure 51 shows a nucleotide sequence encoding a lipid acyltransferase enzyme from 
5 Streptomyces (SEQ ID No. 46); 

Figure 52 shows a ribbon representation of the UVN.PDB crystal structure which has 
glycerol in the active site. The Figure was made using the Deep View Swiss-PDB 
viewer; 

10 

Figure 53 shows 1IVN.PDB Crystal Structure - Side View using Deep View Swiss- 
PDB viewer, with glycerol in active site - residues within 1 oA of active site glycerol 
are coloured black; 

15 Figure 54 shows IIVN.PDB Crystal Structure - Top View using Deep View Swiss- 
PDB viewer, with glycerol in active site - residues within lOA of active site glycerol 
are coloured black; 

Figure 55 shows alignment 1; 

20 

Figure 56 shows alignment 2; 

Figures 57 and 58 show an alignment of IIVN to P10480 (P10480 is the database 
sequence for A, hydrophila enzyme), this alignment was obtained from the PFAM 
25 database and used in the model building process; 

Figure 59 shows an alignment where PI 0480 is the database sequence for Aeromonas 
hydrophila. This sequence is used for the model construction and the site selection. 
Note that the full protein is depicted, the mature protein (equivalent to SEQ ID No. 2) 
30 starts at residue 19. A. sal is Aeromonas salmonicida (SEQ ID No. 28) GDSX lipase, 
A. hyd is Aeromonas hydrophila (SEQ ID No. 26) GDSX lipase. The consensus 
sequence contains a * at the position of a difference between the listed sequences; 
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Figure 60 shows a typical set of 384 clones, the wild type control lies at the 
intersection of 0.9PC, 0.8DGDG; and 

5 Figure 61 shows three areas of interest. Section 1 contains mutants with an increased 
ratio R but lower activity towards DGDG. Region 2 contains mutants with an 
increased ratio R and an increased DGDG activity. Region 3 contains clones with an 
increased PC or DGDG activity, but no increase in the ratio R. 

10 

EXAMPLE 1 

Modelling of Aeromonas hydrophila GDSx lipase on IIVN 

The alignment of the Aeromonas hydrophila GDSX lipase amino acid sequence 
15 (PI 0480) to the Escherichia coli Tioesterase amino acid sequence (IIVN) and the 
Aspergillus aculeatus rhamnogalacturonan acetylesterase amino acid sequence 
(IDEO) was obtained from the PFAM database in FASTA format. The alignment of 
PI 0480 and IIVN was fed into an automated 3D structure modeller (SWISS- 
MODELLER server at www.expasv.org) together with the 1IVN.PDB crystal structure 
20 coordinates file FIGURE 52). The obtained model for PI 0480 was structurally aUgned 
to the crystal structures coordinates of IFVN.PDB and 1DE0.PDB using the 'Deep 
View Swiss-PDB viewer* (obtained from www.expasv.org/spdbv/) (FIGURE 53). The 
an^no acid alignment obtained from the PFAM database (alignment 1 - (FIGURE 55)) 
was modified based on the structural alignment of 1DEO.PDB and HVN.PDB. This 
25 alternative amino acid alignment is called alignment 2 (FIGURE 56). 

The 1IVN.PDB structure contains a glycerol molecule. This molecule is considered to 
be in the active site because it is in the vicinity of the catalytic residues. Therefore, a 
selection can be made of residues that are close to the active site which, due to their 
30 vicinity, are likely to have an influence on substmte binding, product release, and/or 
catalysis. In the IIVN.PDB structure, all amino acids within a 10 A sphere centered on 
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the central carbon atom of the glycerol molecule in the active site were selected 
(amino acid set 1) (See Figure 53 and Figure 54). 

The following amino acids were selected from the P10480 sequence; (1) all amino 
5 acids in PI 0480 correspondmg to the amino acid set 1 in alignment 1; (2) all amino 
acids in PI 0480 corresponding to the amino acid set 1 in alignment 2; (3) from the 
overlay of the PI 0480 model and IIVN all amino acids in the PI 0480 model within 
12A from the glycerol molecule in IIVN, All three groups combined give amino acid 
set 2. ' 

10 

Sequence PI 0480 was aligned to "AAG09804.1 GI:9964017 glycerophospholipid- 
cholesterol acyltransferase [Aeromonas salmonicida]" and the residues in AAG09804 
corresponding to amino acid set 2 were selected to give amino acid set 3. 

15 Set 1,2, and 3 
Amino acid set 1 : 

Amino acid set 1 (note that these are ammo acids in IIVN - Figure 57 and Figure 58.) 
20 Gly8, Asp 9 , Leull, Serl2, TyrlS, Gly44, Asp45, Thr46, Glu69, Leu70. Glv7K Gly72, 
Asn73, Asp74. Gly75, Leu76, Ghil06, nel07, ArglOS, Leul09, ProllO, Tyrll3, 
Phel21, Phel39, PheHO, Metl41, Tyrl45, Metl51, Aspl54. Hisl57. Glyl55, Ilel56, 
Prol58 

25 The highly conserved motifs, such as GDSx and catalytic residues, were deselected 
from set 1 (residues underlined). For the avoidance of doubt, set 1 defines the amino 
acid residues within lOA of the central carbon atom of a glycerol in the active site of 
the IIVN model. 

30 Amino acid set 2: 
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Amino acid set 2 (note that tiie numbering of the amino acids refers to the amino acids 
in the P 1 0480 mature sequence) 

Leul7, Lys22, Met23, Gly40, AsnSO, ProSl, Lys82, Asn87, Asn88, Tiplll, Vail 12, 
Alall4, Tyrll7, LeullS, Prol56, Glyl59, Glnl60, Asnl61, Prol62, Serl63, Alal64, 
5 Argl65, Serl66, Glnl67, Lysl68, Vall69, Vall70, Glul71, Alal72, Tyrl79, Hisl80, 
Asnl81, Met209, Leu210, Arg21 1, Asn215, Lys284, Met285, Gln289 and Val290. 


Table of selected residues in Set 1 compared with Set 2: 

10 



rVN model 

P10480 



Mature sequence residue 

IVN 

A.hyd homologue 

Number 


PFAM 

Structure 


Gly8 

Gly32 



Asp9 

Asp33 



SerlO 

Ser34 



Leull 

Leu35 


Leul7 

Serl2 

Ser36 


Serl8 




Lys22 




Met23 

Tyrl5 

Gly58 


Gly40 

Gly44 

Asn98 


AsnSO 

Asp45 

Pro99 


Pro81 

Thr46 

LyslOO 


Lys82 




Asn87 




Asn88 

Glu69 

Trpl29 


Tiplll 

Leu70 

VaI130 


Vail 12 

Gly71 

Glyl31 



Gly72 

Alal32 


Alall4 

Asn73 

Asnl33 




wo 2005/066347 


PCT/IB2004/004378 


no 


Asp74 

Aspl34 



Gly75 

Tyrl35 


Tyrll? 

Leu76 

Leul36 


LeullS 

Glnl06 


Prol74 

Piol56 

nel07 


Glyl77 

Glyl59 

Argl08 


Gliil78 

Glnl60 

Leul09 


Asnl79 

Asnl61 

ProllO 


180 to 190 

Prol62 

Tyrll3 



Serl63 




Alal64 




Argl65 




Serl66 




Glnl67 




Lysl68 




Vall69 




Vall70 




GIul71 




Alal72 

Phel21 

Hisl98 

Tyrl97 

Tyrl79 



Hisl98 

HislSO 



Asnl99 

Asnl81 

Phel39 

Met227 


Met209 

PheMO 

Leu228 


Leu210 

Metl41 

Arg229 


Arg211 

Tyrl45 

Asii233 


Asn215 




Lys284 

MetlSl 

Met303 


Met285 

Aspl54 

Asp306 



Glyl55 

Gln307 


GIii289 

nel56 

Val308 


Val290 

Hisl57 

His309 
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Prol58 

Pro310 




Amino acid set 3: 

5 Amino acid set 3 is identical to set 2 but refers to the Aeromonas salmonicida (SEQ ID 
No. 28) coding sequence, i.e. the amino acid residue numbers are 18 higher in set 3 as 
this reflects the difference between the andno acid numbering in the mature protein 
(SEQ ID No. 2) compared with the protein including a signal sequence (SEQ ID No. 
28). 

10 

The mature proteins of Aeromonas salmonicida GDSX (SEQ ID No. 28) and 
Aeromonas hydrophila GDSX (SEQ ID No. 26) differ in jfive amino acids. These are 
Thr3Ser, Glnl82Lys, Glu309Ala, Ser310Asn, Gly318-, where the salmonicida residue 
is listed first and the hydrophila residue is listed last (FIGURE 59). The hydrophila 
15 protein is only 317 amino acids long and lacks a residue in position 318. The 
Aeromonas salmonicidae GDSX has considerably high activity on polar hpids such as 
galactolipid substrates than the Aeromonas hydrophila protein. Site scanning was 
performed on all five amino acid positions. 

20 Amino acid set 4: 

Amino acid set 4 is S3, Q182, E309, S3 10, and -318. 

25 Amino acid set 5: 

F13S, D15N, S18G, S18V, Y30F, Dl 16N, Dl 16E, D157 N, Y226F, D228N Y230F. 


30 
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Amino acid set 6: 

Amino acid set 6 is Ser3, Leul7, Lys22, Met23, Gly40, AsnSO, Pro81, Lys82, Asn 87, 
Asn88, Tiplll,Valll2,Alall4,Tyrll7,Leull8, Prol56, Glyl59, Glnl60, Asnl61, 
5 Prol62, Serl63, Alal64, Argl65, Serl66, Glnl67, Lysl68, Vall69, VaI170, Glul71, 
Alal72, Tyrl79, HislSO, Asnl81, Glnl82, Met209, Leu210, Arg211, Asn215, 
Lys284, Met285, Gin289, VaI290, Glu309, SerSlO, -318. 

The numbering of the amino acids in set 6 refers to the amino acids residues in PI 0480 
10 (SEQ ID No. 2) - corresponding amino acids in other sequence backbones can be 
determined by homology alignment and/or structural alignment to PI 0480 and/or 

irvN. 

Amino acid set 7: 

15 

Amino acid set 7 is Ser3, Leul7, Lys22, Met23, Gly40, Asn80, ProSl, Lys82, Asn 87, 
Asn88, Trpl 11, Vail 12, Alal 14, Tyrl 17, Leul 18, Prol56, Glyl59, Glnl60, Asnl61, 
Prol62, Serl63, Alal64, Argl65, Serl66, Ghil67, Lysl68, Vall69, Vall70, Glul71, 
Alal72, Tyrl79, HislSO, AsnlSl, Glnl82, Met209, Leu210, Arg211, Asn215, 

20 Lys284, Met285, Ghi289, Val290, Glu309, Ser310, -318, Y30X (where X is selected 
from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), Y226X (where X is 
selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), Y230X (where 
X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), S18X 
(where X is selected from A, C, D, E, F, H, I, K, L, M, N, P, Q, R, T, W or Y), D157X 

25 (where X is selected from A, C, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y). 

The numbering of the amino acids in set 7 refers to the amino acids residues in P10480 
(SEQ ID No. 2) — corresponding amino acids in other sequence backbones can be 
determined by homology alignment and/or structural ahgnment to PI 0480 and/or 
30 IIVN). 
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From the crystal structure one can obtain the secondary structure classification. That 
means, one can classify each amino acid as being part of an alpha-helix or a beta-sheet. 
Figure 57 shows the PFAM alignment of IDEO, IIVN, and P10480 (the database 
Aeromonas hydrophila). Added below each line of sequence is the structural 
5 classification. 

The PFAM database contains alignments of protems with low sequence identity. 
Therefore, these alignments are not very good. Although the alignment algorithms 
(HAMMER profiles) are well suited for recognizing conserved motifs the algorithm is 
10 not very good on a detailed level. Therefore it is not surprising to find a disparity 
between the PFAM alignment and a structural alignment As a skilled person would be 
readily aware, one can modify the PFAM alignment based on the structural data. 
Meaning that one can align those structural elements that overlap. 

15 FIGURE 55 shows the original PFAM ahgnment of IDEO, IIVN and PI 0480. Added 
to the alignment is the secondary structure information from the crystal stmctures of 
IDEO and IIVN. Alignment 2 in FIGURE 56 shows a manually modified alignment 
where the match between the secondary structure elements is improved. Based on 
conserved residues between either IDEO and PI 0480 or between IIVN and PI 0480 

20 the alignment was modified for PI 0480 as well. To easily distinguish the sequence 
blocks the sequence identifiers in ahgnment 2 have an extm m (IDEOm, lIVNm, 
P10480m). 

Alignment 3 is a mix of 1 and 2, it gives the ahgnment per block. 

25 

EXAMPLE 2: Construction of site scan hbraries 

The Quick Change Multi Site-Directed Mxitagenesis Kit from Stratagene was used 
according to the manufacturers instruction- For each library a degenerate primer with 
30 one NNK or NNS (nucleotide abbreviations) codon was designed. Primer design was 
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perfonned using the tools available on the Stratagene web site. Primer quality control 
was further confirmed using standard analysis tools which analyze the primer for the 
potential of forming hairpins or of forming primer-dimers. 

5 The main concepts of the method are as foUows; using a non-strand displacing high- 
fidelity DNA polymerase such as Pfu-Turbo and a single primer one will linearly 
amplify the DNA template. This is in contrast to the normal exponential amplification 
process of a PGR reaction. This linear amplification process ensures a low error 
fi^uency. The product is single stranded non-methylated DNA and double stranded 
10 hemi-methylated DNA. Kthe template is obtained fix)m a suitable host organism, then 
the template is double stranded methylated DNA. This means that the template DNA 
can be digested with Dpn I endonuclease without digesting the product DNA. 
Therefore upon transformation of the DNA into a suitable host only a very low 
firequency of the transformants with non-mutagenized plasmid. 

15 

EXAMPLE 3: Selection of winners firom a site scan library 

Two alternative approaches are described; Ubrary sequencing followed by analysis of 
unique amino acids, or library analysis followed by sequencing of the winners. 

20 

Selection of winners method 1; library sequencing followed by analysis of unique 
amino acids. 

The transformation/expression shuttle vector used for generation of the site scanning 
25 libraries/variants in E, coli and expression of the variants in B, subtilis was derived 
firom pDP66S, Penninaga et al, ( Biochemistry (1995), 3368-3376), by replacement of 
the selection cassette to a kanamycin selection cassette. The vector used to insert the 
acyl-transferase variant gene in place of the cgt gene down-stream of the P32 
promoter. The vector uses the P32 promoter to drive expression of the acyl-transferase 
30 variant gene in B, subtilis. 


f 
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The expression vector was transformed into nprE-, aprA- Bacillus stAtillis DB104 
(Kawamura and Doi, J. of Bacteriology Oct 1984, p442-444) using transformation 
methods, as described in Chapter 3, Molecular Biological Methods for Bacillus (Ed. 
C.R. Harwood and S.M. Cutting), 1990. John Wiley & Sons Ltd. Chichester, UK). 

5 

Site scan libraries were constructed using a degenerate oligo containing one NNK 
codon, AAiiere K stands for G or T and N stands for A, C, G, or T. This means that a set 
of clones constructed from an amplification reaction using an NNK primer (also 
10 known as *a site scan library*) contains in principle 32 unique codons (4x4x2 = 32 
combination options). Assuming no bias due, the nimiber of clones that one needs to 
pick to have a 95% chance of picking every one of the 32 codons at least once is 95. 
This can be calculated using the following formula 

15 Formula 1; n = { log (1-c) } / { log (1-f) } 

Where n is the nimiber of clones, c is the fraction value of the confidence interval, for 
example the 95% confidence interval has a value of 0.95 and the 99% confidence 
interval has a firaction value of 0.99, and f is the frequency with which each individual 

20 codon occurs, which for an NNK primer is 1/32 or 0.03125. Solving the formula for n 
gives 94.36 or 95 clones. If a 95% confidence interval is deemed to be too low, or if 
one is unable to avoid bias in one or more steps of the library construction process, one 
can decide to assay or sequence more clones. For example, in formula 1, if n is set to 
384, f to 1/32 or 0.03125 then the confidence interval c is much larger than 99%. Even 

25 if 60% of the clones contain the same mutation or the wild type codon, then 363 clones 
will give a 99% confidence of obtaining all 32 codons. From this one can conclude 
that, 384 clones will have a 99% confidence of containing each of the 32 codons at 
least once. 

30 A colony PCR was performed (a PCR reaction on a bacterial colony or on a bacterial 
liquid culture to amplify a firagment from a plasmid inside a bacterium, and 
subsequently sequencing that part of the j&agment which has been mutagenised is an 
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established procedure. Colony PGR can be routinely performed for sets of 96 due to 
the availability of prefabricated material (also known as kits) for colony PGR, 
sequencing, and sequence purijBcation. This entire procedure is offered as a service by 
several commercial companies such as AGOWA GmbH, Glienicker weg 185, D- 
12489 Berlin, Germany. 

After analysing the 96 sequence reactions, the individual clones were selected 
representing one for each codon that is available in the set of 96 sequences. 
Subsequently, for each of the clones representing the mutants, 5 ml of LB broth 
(Casein enzymatic digest, 10 g/1; low-sodium Yeast extract, 5 g/1; Sodium Chloride, 5 
g/l; Inert tableting aids, 2 g/1) supplemented with 50 mg/1 kanamycin, was inoculated 
and incubated at 33 °C for 6 hours at 205 rpm. 0.7 ml of this culture was used to 
inoculate 50 ml of SAS substrate (K2HPO4, 10 g/1; MOPS (3-morpholinopropane 
sulfonic acid), 40 g/1; Sodium Chloride, 5 g/l; Antifoam (Sin 260), 5 drops/1; Soy flour 
degreased, 20 g/l; Biospringer 106 (100 % dw YE), 20 gA) supplemented with 50 mg/1 
kanamycin and a solution of high maltose starch hydrolysates (60 g/1). Incubation was 
continued for 40 hours at 33 °C and 180 rpm before the culture supernatant was 
separated by centrifiigation at 19000 rpm for 30 min. The supematant was transferred 
into a clean tube and directly used for the assay. 

Selection of winners method 2; library screening followed by sequencing of the 
winners 

Although one could choose to sequence 384 clones, one may also assay them and 
select improved variants before sequencing. 

A number of issues should be considered when such a number of samples are 
screened. Without being exhaustive, although it is possible to select variants with 
altered activity on one substrate, the difference in expression level between 384 
cultures can be substantial even if one uses a 384 well microtiter plate, resulting in a 
high background. Therefore, measuring two activities and selecting winners based on a 
change in ratio is a preferred method. To illustrate, if two actiAaties have a certain ratio 
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R then regardless of the absolute amount of enzyme present, the ratio between the two 
activities will always be R. A change in the R value indicates a mutation that changed 
one activity relative to the second activity. 

5 Figure 60 shows a data set obtained from the site scan library. The clones are all tested 
for activity towards phosphatidyl choline (PC) and digalactosyl diglyceride (DGDG). 
All clones, which can be mutated or not, that exhibit no change in the R value will lie 
on a straight line with a certain margin of error. Disregarding these clones three groups 
of interest appear in Figure 61 . 

10 

Section 1 in Figure 61 contains all the clones that have a significantly higher R than 
the wild-type (not mutated) but lower overall DGDG activity. Section 2 contains those 
clones that have both a higher R value and a higher DGDG activity than the wild type- 
Section 3 contains clones that do not have a higher R value, but that do have a 
1 5 significantly higher DGDG or PC activity. 

If one is interested in variants with an increased activity towards DGDG then section 2 
contains the most interesting variants and section 3 contains variants of interest as 
well. The variants in Section 3 which show a large increase in hydrolytic activity may 
20 be accompanied by a decrease in transferase activity. 

One thing is worth noticing, if a specificity determining residue is hit, most of the 20 
possible amino acids could yield a very different R value. However, if the library 
contains a large bias towards a single amino acid (for example 60% is Tyrosine) then 
25 all those variants will still lie on a straight line. 

EXAMPLE 4 : Assavs for PC and DGDG activity in a 384 well microtitcr plate 

Start material 
30 • EM media 

• Plate with transformants 

• Plate with wild type 
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• 384 plates 

• colony picker 

• Waco NEFA-C kit 

• PC and DGDG solutions in a 384 plate 

5 

Part 1 — picking colonies 

• Pick colonies into a 384 plate fiUed with EM medinm 

• Skip 4 wells and inoculate those with colonies containing the non-mutated 
backbone 

1 0 ♦ Grow o/n at 30*^0, 200 rpm shaking speed 

Part 2 - Incubation on substrate 

• Centrifuge the o/n grown plates; 2500 rpm, 20 min 

• Transfer 10 jul supernatant from each well to 2 empty 384 plates 

15 • Add 5 ^1 12.5 raM DGDG to one of the plates, add 5 ^1 12.5 mM PC to the 

other plate 

• Incubate both plates 2 hrs at 37°C, shake at start to mix then stop the shaking 

• Continue with the NEFA C procedure 


20 Part 3 - NEFA-C procedure 

• Add 10 |il A solution 

• Incubate 10 min 37*'C, 300 rpm 

• Add 20 ^1 B solution 

• Incubate 10 min 37°C, 300 rpm 
25 • Read the plate at 550 nm 


Substrate composition — in mM 
25mMPC ellerDQDG 
lOmMCaCla 
30 60 mM Triton X 100 
15mMNaN3 
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20 mM Briton Robinson pH 5.0 

EXAMPLE 5 Selected variants 

5 Determination of enzyme activity 

To determine the eiLzymatic activity towards various substrates 4 \il enzyme solution 
was incubated with 11 jil substrate for 60 minutes at 37°C. Subsequently the amoimt 
of free fatty acids was determined using the WACO NEFA-C kit. To the 15 |xl 
enzyme+substrate mix 75 fxl NEFA solution A was added and incubated for 15 
10 minutes at 37^C. Subsequently 150 ^il NEFA solution B was added and incubated for 
15 minutes. Subsequently the optical density (OD) of the sample was measured at 550 
nm. 

As a control, from each variant 4 |il enzyme solution was incubated with 1 1 ^il HEPES 
15 bujfFer for 60 min at 37°C. Subsequently the amoimt of free fatty acids was determined 
as described above. The OD values of this control sample was deducted from the 
observed OD on each substrate to obtain a corrected activity. 

Four different substrates were used, the composition was in general 30 mg lipid, 4.75 
20 ml 50 mM HEPES buffer pH 7, 42.5 ^1 0.6 M CaC12, 200 \il 10% Triton X-100 
H202-free. The 30 mg lipid was either phosphatidyl choline (PC), PC with cholesterol 
in a 9 to 1 ratio, digalactosyl diglyceride (DGDG), or DGDG with cholesterol in a 9 to 
1 ratio. 

25 Selection of improved variants 

Variants with improved activity towards PC 

Those variants that showed an increase in the OD relative to the wild type enzyme 
when incubated on PC were selected as variants with improved phosphoUpase activity. 
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Variants with improved activity towards DGDG 

Those variants that showed an increase in the OD relative to the wild type enzyme 
when incubated on DGDG were selected as variants with improved activity towards 
DGDG. 

5 

Variants with improved specificity towards DGDG 

The specificity towards DGDG is the ratio between the activity towards DGDG and 
the activity towards phosphatidylchoUne (PQ. Those variants that showed a higher 
ratio between DGDG and PC than the wild type were selected as variants with 
10 improved specificity towards DGDG. 


Variants with improved transferase activity wdth PC as the acyl donor 

The difference in the amount of free fatty acids formed when one incubates an enzyme 

on PC and on PC with cholesterol is an indication of the amount of transferase activity 

15 relative to the amount of hydrolytic activity. Transferase activity will not cause the 
formation of free fatty acids. The transferase preference is the ratio between the free 
fatty acids formed when PC is used as a substrate and the free fatty acids formed when 
PC with cholesterol is used as a substrate. Those variants that show an increase in the 
transferase preference and show a higher than wild type activity towards PC were 

20 selected as having improved transferase activity. 


Variants with improved transferase activity with DGDG as the-acyl donor 

The difference in the amount of free fatty acids formed when one incubates an enzyme 

on DGDG and on DGDG vnth cholesterol is an indication of the amount of transferase 

25 activity relative to the amount of hydrolytic activity. Transferase activity will not cause 
the formation office fatty acids. The transferase preference is flie ratio between the 
free fatty acids fonned when DGDG is used as a subshate and the free fatty acids 
formed when DGDG with cholesterol is used as a substrate. Those variants that show 
an increase in the transferase preference and show a higher than wild type activity 

30 towards DGDG were selected as having improved transferase activity. 
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Selected variants 

For each of the four selection criteria above a number of variants were selected. 
The 'Svild type" enzyme in this example is A, salmonicida (SEQ K) No. 28). 
Variants with unproved activity towards PC: 



PC 

Thr3Asn 

158,0 

Hir3Gln 

151,5 

Thr3Lys 

141,5 

Thr3Arg 

133,0 

Glu309A]a 

106,0 

Thr3Pro 

101,5 

Thr3Met 

96,0 

wild-type 

86.5 


Variants with improved activity .towards DGDG: 



DGDG 


Glnl82Asp 

66,5 


Glu309AIa 

60 


Tyr230Thr 

59 


Tyr230Gly 

57,5 


Tyr230Gly 

51 


Thr3Gln 

44,5 


wild-type 

43,5 


Variants with improved specificity towards DGE 


R 

DGDG/PC 

PC 

DGDG 

Glnl82Asp 

1,02 

65,5 

66,5 

Tyr230Gly 

0,79 

72,5 

57,5 

Tyr230Gly 

0,78 

65,0 

51,0 
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lyriiOlnr 

0,75 

78,5 

59,0 

iyrziUVal 

0,71 

58,0 

At rv 

41,0 

Aspl57Cys 

0,69 

AO f\ 

48,0 

33,0 

vjluiUyrro 

A CO 

/J ,5 

4z,5 

Glu309AIa 

0,57 

106,0 

60,0 

GlySlSUe 

0,53 

69,5 

36,5 

Tyr230Arg 

0,50 

63,5 

32,0 

Tyr230Met 

0,50 

64,5 

32,5 

wild-type 

0,50 

86,5 

43,5 


Variants with improved transferase activity with PC as the acyl donor: 



Rpc+Cho/pc 

PC 

PC+Cho 

Thr3Lys 

0,54 

142 

76 

Thr3Arg 

0,55 

133 

73 

Thr3Gln 

0,63 

152 

96 

Thr3Asn 

0,64 

158 

101 

Thr3Pro 

0,67 

102 

68 

Thr3Met 

0,78 

96 

75 

wild-type 

0,83 

87 

72 


Variants with improved transferase activity with DGDG as the acyl donor: 



RoGIXj+Cho/DG 
DO 

DGDG 

Tyr230Thr 

1,10 

59 

Glnl82Asp 

1,39 

67 

Tyr230Gly 

1,55 

58 

Glu309Ala 

1,78 

60 

wild-type 

1,78 

44 


5 
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EXAMPLE 6: Transferase assay Phospholipidxholesterol 

5 

Phospholipid can be replaced by DGDG to provide a transferase assay from a 
galacolipid. Other acceptors for example, glycerol, glucose, hydroxy acids, proteins or 
maltose can also be used in the same assay. 

10 300 mg Phosphatidylcholine (Avanti #441601):Cholesterol(Sigma C8503) 9:1 is 
scaled in a Wheaton glass. 10 ml 50 mM HEPES buffer pH 7.0 is added and stirring at 
40 °C disperses the substrate 

0,5 nod substrate is transferred to a 4 ml vial and placed in a heating block at 40 °C. 
0.050 ml transferase solution is added, also a control with 0.050 ml water is analysed 
15 in the same way. The reaction mixture is agitated for 4 hours at 40 °C. The sample is 
then frozen and lyophilised and analysed by GLC. 
Calculation: 

From the GLC analysis the content of free fatty acids and cholesterol ester is 

calculated. 

20 The enzymatic activity is calculated as: 
% Transferase activity^ 

A % cholesterol ester/(Mv sterol ester) x 100 , 

25 A % cholesterol ester/(Mv cholesterol ester) +A % fatty acid/(Mv fatty acid) 

% Hydrolyse activity= 


30 A % fatty acid/(Mv fatty acid) x 1 00 , 

A % cholesterol ester/(Mv cholesterol ester) +A % fatty acid/(Mv fatty acid) 
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Ratio Transferase/Hydrolyse transferase activity/% Hydrolyse activity 
Where: 

A % cholesterol ester = % cholesterol ester(sample)-% cholesterol ester(control). 
A % fatty acid = % fatty acid(saniple) - % fatty acid(controI). 

Transferase assay Galactolipidicholesterol. 

300 mg Digalactosyldiglyceride (DGDG) (purity >95 galactolipids, the DGDG used is 
purified from wheat lipid. DGDG from Sigma D4651 is also suitable for 
use):Cholesterol(Sigma) 9:1 is scaled in a Wheaton glass. 10 ml 50 mM HEPES buffer 
pH 7.0 is added and stirring at 40 disperses the substrate. 

0,5 ml substrate is transferred to a 4 ml vial and placed in a heating block at 40 °C. 
0.050 ml transferase solution is added, also a control with 0.050 ml water is analysed 
in the same way. The reaction mixture is agitated for 4 hours at 40 ""C. The sample is 
then frozen and lyophilised and analysed by GLC. 
Calculation: 

From the GLC analysis the content of free fatty acids and cholesterol ester is 

calculated. 

The enzymatic activity is calculated as: 

% Transferase acti vity= A % cholesterol ester/rMv sterol ester) x 1 00 , 

A % cholesterol ester/(Mv cholesterol ester) +A % fatty acid/(Mv fatty acid) 

% Hydrolyse activity= A % fatty acid/(Mv fattv acid) x 100 , 

A % cholesterol ester/(Mv cholesterol ester) +A % fatty acid/(Mv fatty acid) 

Ratio Transferase/Hydrolyse =% transferase activity/% Hydrolyse activity 
Where: 


wo 2D05/066347 


PCT/IB2004/0a4378 


125 

A % cholesterol ester = % cholesterol ester(sample)-% cholesterol ester(control). 
A % fatty acid = % fatty acid(sample) - % fatty acid(control) 

5 EXAMPLE 7: Variants of a lipid acvltransferase for Aeromonas hvdrophila fSEO ID 
No. 26) 

Mutations were introduced using the QuikChange™ Mxilti-Site Directed Mutagenesis 
kit from Stfatagene, La JoUa, CA92037, USA following the instructions provided by 
10 Stratagene. 

Variants at Tyr256 showed an increased activity towards phospholipids. 

Variants at Tyr256 and Tyr260 showed an increased activity towards galactolipids. 

15 

Variants at Tyr265 showed an increased transferase activity with galactolipids as the 
acyl donor. 

The numbers indicate positions on the following sequence: An enzyme from 
20 Aeromonas fjydrophila the amino acid sequence of which is shown as SEQ ID No. 26. 
The nucleotide sequence is as shown as SEQ ID No. 27. 


EXAMPLE 8: Screening of mutants of glvcerophospholipidxholesterol 
acvltransferase GCAT from Aeromonas salmonicida . 

25 Mutants from point mutations of glycerophospholipidxholesterol acyltransferase 
GCAT from Aeromonas salmonicida were screened for transferase activity using 
phosphatidylcholine or digalactosyldiglyceride as donor and cholesterol as acceptor 
with the aim to select mutant with better activity towards digalactocyldiglyceride than 
phosphatidylcholine. 


30 
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GCAT mutants were screened for transferase activity using 
digalactosyIdiglyceride(DG) and phosphatidyIcholine(PC) as donor and cholesterol as 
acceptor. 

5 DG (purity >95% digalactosyldiglyceride (DGDG iised is purified fi-om wheat lipid. 
DGDG from Sigma D4651 is also suitable for use jfrom Signma D4651),) and 
cholesterol (Sigma C8503) was scaled in the ratio 9 :1 and dissolved in chloroform 
and evaporated to dryness. 

10 The substate was prepared by dispersing of 3% DGiCholesterol 9:1 in 50 mM HEPES 
buffer pH 7. 

0,250 ml substrate was transferred to a 3 ml glass with screw lid. 0,025 ml supematant 
from fermentation of mutant GCAT was added an incubated at 40 °C for 2 hours. A 
15 reference sample with water instead of enzyme was also prepared. Heating the reaction 
mixture in a boiling water bath for 1 0 minutes stopped the enzyme reaction. 

2 ml 99% ethanol was added and submitted to cholesterol analysis as well as free fatty 
acid analysis. 

20 

Cholesterol assay. 

100 jil substrate containing: 1.4 U/ml Cholesterol oxidase( SERVA Electrophoresis 
GmbH cat. No 17109), 0,4 mg/ml ABTS (Sigma A-1888), 6 U/ml Peroxidase (Sigma 
6782) m 0,1 M TRIS,HC1 buffer pH 6.6 + 0,5% Triton X 100(Sigma X-100) was 
25 incubated at 37°C for 5 minutes. 5^1 cholesterol sample was added and mixed. The 
reaction mixture was incubated for fiirther 5 minutes and OD 405imi measured. The 
content of cholesterol was calculated from the analyses of standard solutions of 
cholesterol containing 0,4mg/ml, 0,3mg/ml , 0,2Qmg/ml, 0,lmg/ml, 0,05 mg/ml and 0 
mg/ml. 

30 

Free fatty acid assay. 
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Free fatty acids in the sample was measured using a NEFA C kit ( WAKO Chemdcals 
GmbH) 

75 ^il NEFA reagent A was incubated for 10 minutes at 37 ""C. 15 nl enzyme sample 
was added and mixed. The reaction mixture was incubated forlO minutes. 150 ^1 
NEFA reagent B was added, mixed and incubated for further 10 minutes and OD 540 
nm was measured. Free fatty acid was calculated from standard solutions of 0,4, 0.3, 
0.2, 0.1, 0.05 and 0 mM fatty acid. 

Transferase assay using phosphatidylcholine as donor was measured in the same way, 
but using phosphatidylcholine(Avanti #441601) instead of DG (DGDG). 

Transferase activity was expressed as % cholesterol esterified calculated from the 
difference in free cholesterol in the reference sample and free cholesterol in the 
enzyme sample. 

Hydrolyfic activity was expressed as % free fatty acid produced calculated from 
the difference in free fatty acid in the enzyme sample and free fatty acid in the 
reference sample. 

The relative Transferase activity agaiost DG and PC was calculated as % Tdg/Tpc- 

The transferase activity Tdg relative to the hydrolytic activity Hdg on DG for the 
mutants were calculated: 

0.1 X %TDG 7386 = 0.1 x %TDG x 280 
%HDG/280 %HDG x386 

Where 386 = MW for cholesterol and 280 = MW for fatty acid. 

Mutants with Tdg >50% and Tdc/Tpc > 3 and 0.1 x%Tnn 7386 > 2.5 

%Hdg7280 


were selected as improved mutants. 
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The data obtained firom the above example can be analysed via statistics to identify 
and prioritise key sites and/or specUBc amino acid substitutions which provide the 
desired activity profile, such as increased ration of Tdg as cx)mpared to Tpc- For 
5 example, the follovraig robust modeling is proposed: 

The information regarding Tpc and Tdg is carried by the censored responses max(0, 
Tpc) and max(0, TDG)-The objective of the study is to identify settings determining Tjx} 
>=Tpc, based on the scores for hi(l+ TDG)-ln(l+ Tpc) with positive values as 
preference, both in absolute scale and in relative scale compared to a control (native). 
The preferred settings are identified based on a binary response (Event, Non-Event), 
where Event is defined as a preferred response in relation to the scores. A binomial 
GLIM model with complementary log-log link, based on the empirical data structure 
without prior information included, analyses the binary responses. See the following 
reference for details of how to perform the statistical analysis: proc LOGISTIC in 
SAS Institute hic, SAS/STAT® User's Guide, Version 6, 4JEd, Vol.2, Gary, NC: SAS 
Institute Inc., 1989. 


Variants with 

Variants with 

Variants with enhanced Tdg/ Hdg 

increased Tdg/Tpc 

oihanced DGDG 

activity 


transferase activity 



Tdg 



N80P,G,orE 

Y179E,R,N,Q 

K22E,K 

S310Q,HorS 

N215G 

G40L 

S3 E, A, G, K, M, Y, 

L210 D, H, R, E. A, Q, P, N, K, G, R, 

N87R,D,E,M 

R,P,N,T,orQ 

T,W,I,VorS 

Y117 A, N, E, H, 

-318 R, S, E, H. Q, 

N80G 

T 

NorD 

Y30L 

Q182K,T 

N215 L, G, V, R or 

N87G 

M209K,M 

Y 

H180I,T 

L210N 

K82S 

M209Y 

R211 G 


R211D,TorG 

N215H 


S18G,MorT 

Y230I 


G40RorM 

-318 Y,HorS 


N88W 

N215H 


N87 C,D,R,Eor G 

L210D,QorT 



E309S,QorR 




10 


15 
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Ml 80 K or O 



IN OU JIN, Iv or jlJ 






M ^ W V A P 

IVl, O, W, V, /\, Iv, 









js^ or o 



V 1 IZ V-/ 






iVl, /\ Or IV 



V U K, Ji, rl or A 



V{ K or IN 



JVZZ IS 






1 1 /y V 



MZUy Lr, K, M 



L211 G,Q,KorD 



Y230 V 



S310P 



Y179R 



H180T 



Q289TorD 



G40Q,LorV 



N88W 



N87RorD 




EXAMPLE 9: Selection of improved mutants of glycerophospholipidxholesterol 
acyltransferase GCAT &om Aeromonas salmonicidcL 

5 

The "parent" enzyme in this example is^. salmonicida (SEQ E) No. 28). 

32 positions of GCAT from Aeromonas salmonicida (230 Tyr, 182 Lys, 3 Thr, 157 
Asp, 310 Thr, 318 Gly, 309 Glu, 17 Leu, 111 Trp, 117 Tyr, 179 Tyr, 118 Leu, 215 
10 Asn, 22 Lys, 290 Val, 289 Gin, 285 Met, 18 Ser, 23 Met, 180 His, 284 Lys, 181 Asn, 
209 Met, 210 Leu, 21 1 Arg, 40 Gly, 81 Pro,l 12 Val, 80 Asn, 82 Lys, 88 Asn, 87 Asn 
were screened according to the experimental outline in Example 8. 
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Based on the results from the screening and the three selection criteria the following 
mutants listed in the table 1 were selected. 


Table 1 


Position 

Amino acid 

T,PC 

T,DG 

TJDG/T,PC 

H,PC 

H,DG 

210 

GLN 

10,3 

59,7 

5,9 

0,0 

0,0 

215 

GLY 

15,1 

55,8 

4,5 

3,1 

1,0 

215 

LEU 

19,4 

51,9 

3,3 

4,3 

1,1 

215 

TYR 

21,3 

68,0 

3,9 

4,3 

1,8 

215 

ARG 

16,2 

62,1 

4,7 

5,5 

2,1 

215 

VAL 

14,7 

61,6 

5,2 

3,5 

1,7 

215 

fflS 

5,7 

50,1 

10,9 

4,2 

1,3 

215 

ASN 

9,4 

47,4 

6,2 

4,0 

1.2 


EXAMPLE 10: Selection of specific amino acid regions of interest for mutation of the 
glycerophospholipid:cholesterol acyltransferase GCAT from Aeromonas salmonicida. 

10 

From the pfam alignment (alignment 2; FIGURE 56) and overlay of the P10480 model 
and IIVN all amino acids in regions surrounding the glycerol molecule in the active 
site of IIVN were selected and used for defining regions of specific interest (loops). 
(Numbers refer to the amino acids in the PI 0480 mature sequence (SEQ ID IMo. 2)): 

15 

Thr 20- Arg 41 (Loop 1, LI) 
ne 77- Leu 89 (Loop 2, L2) 
Leu 1 18 - Asp 127 (Loop 3, L3) 
Gly 146- Val 176 (Loop 4, L4) 
20 Glu208-Tip287(Loop5,L5) 

The intervening regions (IVR) were named accordingly: 
Ala 1- Asp 19(IVR1) 
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Phe42-Lys76(IVR2) 
Asp90-Tyrll7(IVR3) 
Ala 128-Asnl45(IVR4) 
Serl77-Ala207(IVR5) 
5 Asp 288 -His 317 (IVR6) 


The following table summarizes the allocation of preferred positions for mutation of 
the glycerophospholipidrcholesterol acyltransferase GCAT from Aeromonas 
salmonicida. The results are based on experimental outlines as set out in Example 8- 
10 10. 



P10480 
amino acid 
positions 
(SEQ ID No 
2) 

Preferable 
sites to 
produce 
variants 
with 

increased 
Tdg/Tpc 

lOA 

Preferred 
regions for 
methods of the 
invention 

rvRi 

1-19 


L17,S18 


Loopl 

20-41 

K22, G40, 
Y30 

K22,M23, G40 

Loopl 

IVR2 

42-76 




Loop2 

77-89 

N80, N87, 
N88 

N80, P81, K82, 
N87,N88 

Loop 2 

IVR3 

90-117 

Y117, 
V112, 
Will, 
A114, 

Will, VI 12, 
A114, Y117 

IVR3 

IVR3 & lOA 
from active site. 

LoopS 

118-127 


LI 18 


IVR4 

128-145 




Loop4 

146-176 


PI 56 


IVR5 

177-207 

N181, 
QI82, 
HI 80, 
Y179 

Y179, H180, 
N181 

IVR5 

IVR5 & lOA 
from active site. 

LoopS 

208-287 

M209, 

L210, 

R211, 

N215, 

Y230 

M209, L210, 
R211, N215, 
K284, M285, 
Q289, V290 

Loop 5 

Loop 5 & lOA 
from active site. 

IVR6 

288-317 

Q 289, 


IVR6 
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V290, E309 
S310,-318 




All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and system 
of the present invention will be apparent to those skilled in the art without departing 
5 from the scope and spirit of the present invention. Although the present invention has 
been described in connection with specific preferred embodunents, it shoiild be 
understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out 
the invention which are obvious to those skilled m biochemistry and biotechnology or 
1 0 related fields are intended to be within the scope of the following claims. 
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CLAIMS 

1. A method of producing a variant glycolipid acyltransferase enzyme comprising: (a) 
selecting a parent enzyme which is a lipid acyltransferase enzyme characterised in that 
the enzyme comprises the amino acid sequence motif GDSX, wherein X is one or 

5 more of the following amino acid residues L, A, V, I, F, Y, H, Q, T, N, M or S; (b) 
modifying one or more amino acids to produce a variant lipid acyltransferase; (c) 
testing the variant lipid acyltransferase for transferase activity, and optionally 
hydrolytic activity, on a galactolipid substrate, and optionally a phosphoUpid substrate 
and/or optionally a triglyceride substrate; (d) selecting a variant enzyme with an 
10 enhanced activity towards galactolipids compared with the parent enzyme; and 
optionally (e) preparing a quantity of the variant enzyme. 

2. A method according to claim 1 wherein the method comprises testing the variant 
lipid acyltransferase for: 

15 (i) transferase activity from a galactolipid substrate, and 

(ii) transferase activity from a phospholipids substrate; and 
selecting a variant enzyme, which when compared with the parent enzyme, has an 
enhanced ratio of transferase activity from galactolipids compared with phospholipids. 

20 3. A method according to claim 2 wherein the ratio of transferase activity from 
galactolipids compared with phospholipids is at least 3. 

4. A method according to any one of the proceedings claims comprising testing the 
variant lipid acyltransferase for: 
25 (a) transferase activity from a galactolipid substrate, and 

(b) hydrolytic activity on a galactolipid substrate; and 
selecting a variant enzyme with an enhanced ratio of transferase activity from 
galactolipids compared with its hydrolytic activity on glycolipids, compared with the 
parent enzyme. 
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5. A method according to claim 4 wherien the enhanced ratio of transferase activity on 
galactoUpids compared to hydrolytic activity on galactolipids is at least 1.5. 

6. A method according to any one of the preceding claims wherein one or more of the 
5 following amino acid residues identified by alignment with SEQ ID No. 2 is modijBed 

compared with a parent sequence at any one or more of the amino acid residues 
defined in set 2 or set 4 or set 6 or set 7. 

7. A method according to any one of the preceding claims wherein the parent enz5mae 
10 comprises an amino acid sequence as shown as SEQ ID No. 2, SEQ ID No. 3, SEQ ED 

No. 4, SEQ ID No. 5, SEQ ED No. 6, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 1 6, 
SEQ ID No. 18, SEQ ID No. 20, SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, 
SEQ ID No. 28, SEQ ID No. 30, SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, 
SEQ ID No. 37, SEQ ID No. 39, SEQ ID No. 41, SEQ ID No. 43 or SEQ ID No. 45, 
15 or an amino acid sequence which has at least 70% identity therewith. 

8. A method according to any one of the preceding claims wherein the parent enzyme 
is an enzyme which comprises the amino acid sequence shown as SEQ ID No. 2 
and/or SEQ ID No. 28. 

20 

9. A method according to any one of the preceding claims v^erein Preferably, the X of 
the GDSX motif is L. 

10. A method according to any one of the preceding claims wherein the method further 
25 comprises one or more of the following steps: structural homology mapping or 

sequence homology alignment. 

11. A method according to claim 10 wherein tiie structural homology mapping 
comprises one or more of the following steps: 

30 a) aUgning a parent sequence with a stmctural model (IFVN.PDB) shown in Figure 
52; 
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b) selecting one or more amino acid residue within a lOA sphere centred on the 
central carbon atom of the glycerol molecule in the active site (see Figure 53); and 

c) modifying one or more amino acids selected in accordance with step (b) in said 
parent sequence. 

5 

12. A method according to claim 10 wherein the structural homology mapping 
comprises one or more of the following steps: 

a) aligning a parent sequence with a structural model (IFVN.PDB) shown in Figure 
52; 

10 b) selecting one or more amino acids within a lOA sphere centred on the central 

carbon atom of the glycerol molecule in the active site (see Figure 53); 
c) determining if one or more amino acid residues selected in accordance with step 

(b) are highly conserved (particularly are active site residues and/or part of the 

GDSx motif and/or part of the GANDY motif); and 
15 d) modifying one or more amino acids selected in accordance with step (b), excluding 

conserved regions identified in accordance with step (c) in said parent sequence. 

13. A method according to claim 10 wherein the sequence homology alignment 
comprises one or more of the following steps: 

20 i) selecting a first parent lipid acyltransferase; 

ii) identifying a second related lipid acyltransferase having a desirable activity; 

iii) aligning said first parent lipid acyltransferase and the second related lipid 
acyltransferase; 

iv) identifying amino acid residues that differ between the two sequences; and 

25 v) modifying one or more of the amino acid residues identified in accordance with 
step (iv) in said parent lipid acyltransferase. 

14. A method according to claim 10 wherein the sequence homology alignment may 
comprise one or more of the following steps: 

30 i) selecting a first parent lipid acyltransferase; 

ii) identifying a second related lipid acyltransferase having a desirable activity; 
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iii) aligning said first parent lipid acyltransferase and the second related lipid 
acyltransferase; 

iv) identifying amino acid residues that dijffer between the two sequences; 

v) determining if one or more amino acid residues selected in accordance with 
5 step (iv) are highly conserved (particularly are active site residues and/or part 

of the GDSx motif and/or part of tiie GANDY motif); and 

vi) modifying one or more of the amino acid residues identified in accordance with 
step (iv) excluding conserved regions identified in accordance with step (v) in 
said parent sequence. 

10 

15. A method according to any one of the preceding claims comprising modifying one 
or more of the following amino acid residues: -318, N215, L210, S310, E309, H180, 
N80, VI 12, Y30X (where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, 
S, T, V, or W), V290, Q 289, K22, G40, Y179, M209, L211, K22, P81, N87, Y117, 

15 N181, Y230X (where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, 
V,orW),Q182. 

16. A method according claim 15 comprising modifying one or more of the following 
amino acid residues: -318, N215, L210, E309, HI 80, N80. 

20 

17. A method according to any one of claims 1-14 comprising modifying one or more 
of the following amino acid residues: -318, N215, L210, S310, E309, H180, N80, 
VI 12, Y30X (where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, 
V, or W), V290, Q 289, K22, G40, Y179, M209, L211, K22, P81, N87, Y117, N181, 

25 Y230X (where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or 
W),Q182, S3, K82. 


18. A method according to any one of claims 1-14 comprising modifying one or more 
of the following amino acid residues: Y230X (where X is selected from A, C, D, E, G, 
30 H, I, K, L, M, N, P, Q, R, S, T, V, or AV), S3 10, Y179, H180, Q289, G40, N88, N87. 
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19. A method according to any one of claims 1-14 comprising modifying one or more 
of the following amino acid residues: Y179, N215, L210, N80, Y30X (where X is 
selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), N87. 

5 20. A method according to any one of claims 1-14 comprising modifying one or more 
of the following amino acid residues: Y179, N215, L210, N80, Y30X (where X is 
specificaUy selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), 
N87, H180, M209, R21 1, S18X (where X is specificaUy selected from A, C, D, E, F, 
H, I, K, L, M, N, P, Q, R, T, W or Y), G40, N88, N87. 

10 

21. A variant glycolipid acyltransferase enzyme characterised in that the enzyme 
comprises the amino acid sequence motif GDSX, wherein X is one or more of the 
following amino acid residues L, A, V, I, F, Y, H, Q, T, N, M or S, wherein the variant 
has an enhanced activity towards galactolipids compared with the parent enzyme and 

15 wherein the variant enzyme comprises one or more amino acid modifications 
compared with a parent sequence at any one or more of the amino acid residues 
defined in set 2 or set 4 or set 6 or set 7. 

22. A variant glycolipid acyltransferase enzyme according to claim 21 wherein the 
20 variant enzyme comprises an amino acid sequence, which amino acid sequence is 

shown as SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, 
SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 16, SEQ ID No. 18, SEQ ID No. 20, 
SEQ ID No. 22, SEQ ID No. 24, SEQ ID No. 26, SEQ ID No. 28, SEQ ID No. 30, 
SEQ ID No. 33, SEQ ID No. 34, SEQ ID No. 36, SEQ ID No. 37, SEQ ID No. 39, 
25 SEQ ID No. 41, SEQ ID No. 43 or SEQ ID No. 45 except for one or more amino acid 
modifications at any one or more of the amino acid residues defined in set 2 or set 4 or 
set 6 or set 7. 

23. A variant glycolipid acyl transferase according to claim 21 or claim 22 wherein the 
30 enzyme comprises one or more amino acid modifications at any one or more of the 

following amino acids: -318, N215, L210, S310, E309, H180, N80, V112, Y30X 
(where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), 
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V290, Q 289, K22, G40, Y179, M209, L211, K22, P81, N87, Y117, N181, Y230X 
(where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T. V, or W), 
V290,N87,Q182. 

5 24. A variant glycolipid acyl transferase according to claim 23 wherein the enzyme 
comprises one or more amino acid modification at any one or more of the following 
amino acid residues: -318,N215,L210,E309, H180,N80. 

25. A variant glycolipid acyl transferase according to claim 21 or claim 22 -wiierein the 
10 en2yme comprises one or more amino acid modifications at any one or more of the 

following amino acids: -318, N215, L210, S310, E309, H180, N80, V112, Y30X 
(where X is selected &om A, C, D, E, G, H, 1, K, L, M, N, P, Q, R, S, T, V, or W), 
V290, Q 289, K22, G40, Y179, M209, L211, K22, P81, N87, Y117, N181, Y230X 
(where X is selected from A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), 
15 V290,N87, Q182, S3, S310, K82, E309. 

26. A variant glycolipid acyl transferase according to claim 21 or claim 22 wherein the 
enzyme comprises one or more amino acid modifications at any one or more of the 
folloAwng amino acids: Y230X (where X is selected from A, C, D, E, G, H, I, K, L, M, 

20 N, P, Q, R, S, T, V, or W), S310, Y179, H180, Q289, G40, N88, N87. 

27. A variant glycolipid acyl transferase according to claim 21 or claim 22 wherein the 
enzyme comprises one or more amino acid modifications at any one or more of the 
following amino acids: Y179, N215, L210, N80, Y30X (where X is selected from A, 

25 C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W), N87. 

28. A variant glycolipid acyl transferase according to claim 21 or claim 22 wherein 
the enzyme comprises one or more amino acid modifications at any one or more of the 
following ammo acids: Y179, N215, L210, N80, Y30X (where X is specificaUy 

30 selected from A. C, D, E, G. H, I, K, L, M, N, P, Q, R, S, T, V, or W), N87, HI 80. 
M209, R2 1 1 , S 1 8X (where X is specificaUy selected from A, C, D, E, F, H, 1, K, L, M, 
N. P, Q, R, T, W or Y), G40, N88, N87. 
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29. A variant glycolipid acyltransferase enzyme according to any one of claims 21-28 
wherein the variant enzyme has an enhanced ratio of activity on galactolipids to either 
phospholipids and/or triglycerides when compared with the parent enzyme. 

5 

30. A variant glycohpid acyltransferase according to any one of claims 21-29 wherein 
the variant enzyme has a higher galactolipid transferase activity compared with its 
galactolipid hydrolytic activity compared with the parent enzyme. 

10 31. A variant glycolipid acyltransferase enzyme according to any one of claims 21-30 
wherein the variant enzyme is an enzyme which comprises an amino acid sequence, 
which amino acid sequence is shown as SEQ ED No. 2 or SEQ ID No. 28 except for 
one or more amino acid modifications at any one or more of the amino acid residues 
defined in set 2 or set 4 or set 6 or set 7. 

15 

32. Use of a variant glycolipid acyltransferase enzyme according to any one of claims 
21-31 or obtained by the method according to any one of claims 1-20 in a substrate for 
preparing a lyso-glycolipid, for example digalactosyl monoglyceride (DGMG) or 
monogalactosyl monoglyceride (MGMG) by treatment of a glycolipid (e.g. 
20 digalactosyl diglyceride (DGDG) or monogalactosyl diglyceride (MGDG)) with the 
variant lipolytic enzyme .according to the present invention or obtained by a method 
according to the present invention to produce the partial hydrolysis product, i.e. the 
lyso-glycolipid. 

25 33. Use according to claim 32 wherein the substmte is a foodstuff. 

34. A method of preparing a foodstuff the method comprising adding a variant 
glycohpid acyltransferase enzyme according to any one of claims 21-31 or obtained by 
the method according to any one of claims 1-20 to one or more ingredients of the 
30 foodstuff 
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35. A method of preparing a baked product from a dough, the method comprising 
adding a variant glycolipid acyltransferase enzyme according to any one of claims 21- 
31 or obtained by the method according to any one of claiunis 1-20 to the dough. 

5 36. Use of a variant glycolipid acyltransferase enzyme according to any one of claims 
21-31 or obtained by the method according to any one of claims 1-20 in a process of 
treating egg or egg-based products to produce lysophospholipids. 

37. A process of enzymatic degumming of vegetable or edible oils, comprising treating 
10 the edible or vegetable oil with a variant glycolipid acyltransferase enzyme according 
to any one of claims 21-31 or obtained by the method according to any one of claims 
1-20 so as to hydrolyse a major part of the polar lipids (e.g. phospholipid and/or 
glycolipid). 

15 38. Use of a variant glycoUpid acyltransferase enzyme according to any one of claims 
21-31 or obtained by the method according to any one of claims 1-20 in a process for 
reducing the content of a phospholipid in an edible oil, comprising treating the oil with 
said variant lipolytic enzyme so as to hydrolyse a major part of the phospholipid, and 
separating an aqueous phase containing the hydrolysed phospholipid from the oil. 

20 

39. Use of a variant glycolipid acyltransferase enzyme according to any one of claims 
21-31 or obtained by the method according to any one of claims 1-20 in the 
bioconversion of polar lipids (preferably glycolipids) to make high value products, 
such as carbohydrate esters and/or protein esters and/or protein subunit esters and/or a 

25 hydroxy acid ester. 

40. An immobilised variant glycolipid acyltransferase enzyme according to any one of 
claims 21-31 or obtained by the method according to any one of claims 1-20. 

30 41. A variant glycoUpid acyltransferase enzyme generally as described herein with 
reference to the figures and examples. 
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42, A method generally as described herein with reference to the figures and examples. 
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Figure 1 


SEQIDNo. 1 

1 ivafGCSlTd geayygdsdg ggwgagladr 
61 ivDalvallF laqslglpnL pPYlsgdflr 
121 vlelrqalgl Iqellrllpv Idakspdlvt 
181 kdnlrqli)cr Lrsnngarii vlitlvilnl 
241 eavadfneal relaislcled qlrJtdglpdv 
301 ettJcaCCGyG gryNynrvCG naglcnvtak 
361 1 

Figure 2 


Ltallrlrar prgvdvfnrg isGrtsdGrl 
GAHFAsagAt Ilptsgpfli QvqFkdfksq 
imiGtNDlit saffgpkste sdrnvsvpef 
gplGClPlkl alalassknv dasgclerln 
kgadvpyvDl ysifqdldgi qnpsayvyGF 
aCnpssylls flfwDgfHps ekGykavAea 


SEQIDNo. 2 

ADSRPAFSRIVMFGDSLSDTGKMYSKMRGYLPSSPPYYEGRFSNGPN/WLEQLTNEF 

PGLTIANEAEGGPTAVAYNKISWNPKYQVINNLDYEVTQFLQKDSFKPDDLVILWVGA 

NDYLAYGWNTEQDAKRVRDAISDAANRMVLNGAKEILLFNLPDLGQNPSARSQKW 

EAASHVSAYHNQLLLNLARQIAPTGMVKLFEIDKQFAEMLRDPQNFGLSDQRNACY 

GGSYVWKPFASRSASTDSQLSAFNPQERLAIAGNPLIAQAVASPMAARSASTLNCE 

GKMFWDQVHPTTWHAALSEPAATFIESQYEFLAH 


Figure 3 


SEQIDNo. 3 

1 lakkwfvcllg lialtvqaad trpafsrivm fgdslsdtgk myskmrgylp ssppyyegrf 

61 sngpvwleql tkqfpgltia neaeggatav aynkiswnpk yqvynnldye vtqflqkdsf 

121 kpddlvilwv gandylaygw nteqdakrvr daisdaanrm vlngakqill fnlpdlgqnp 

IBl sarsqkwea vshvsayhnk lllnlarqla ptgmvklfei dkqfaemlrd pqnfglsdve 

241 npcydggyvw kpfatrsvst drqlsafspq erlaiagnpl laqavaspma rrsasplnce 
301 gkrafwdqvhp ttwhaalse raatfietqy eflahg 


Figure 4 


SEQIDNo. 4 

1 mpkpalrrvm tatvaavgtl algltdatah aapaqatptl dyvalgdsys agsgvlpvdp 
61 anllclrsta nyphviadtt gaxltdvtcg aaqtadftra qypgvapqld algtgtdlvt 
121 Itiggndnst finaitacgt agvlsggkgs pckdrhgtsf ddeieantyp alkeallgvr 
181 arapharvaa Igypwitpat adpscflklp laagdvpylr aiqahlndav rraaeetgat 
241 yvdfsgvsdg hdaceapgtr wiepllfghs Ivpvhpnalg errroaehtTiui vlgld 
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SEQIDNo.5 

1 iDpIcpalrrvm tatvaavgtl algltdatah aapaqatptl dyvalgdsys agsgvlpvdp 
61 anllclrsta nyphviadtt garltdvtcg aaqtadftra qypgvapqld algtgtdlvt 
121 Itiggndnst finaitacgt agvlsggkgs pckdxhgtsf ddeieantyp alkeallgvr 
181 arapharvaa Igypwitpat adpscflklp laagdvpylr aiqahlndav rraaeetgat 
241 yvdfsgvsdg hdaceapgtr wiepllfghs Ivpvhpnalg ernnaehtind vlgld 


Figure 6 


SEQIDNo.6 


1 indyeJcfllfg dsitef afnt rpiedgkdqy algaalvney trkmdilqrg f kgytsrwal 
61 kilpeilkhe snivmatifl gandacsagp qsvplpefid nirqmvslmk syhirpiiig 
121 pglvdrekwe kekseeialg yfrtnenfai ysdalaklan eekvpfvaln kafqqeggda 
181 wqqlltdglh fsgfcgykifh dellkvletf ypqyhpkniDq yklkdwrdvl ddgsnims 
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Figure 7 


Alignment of pfam00657.6 consensus sequence with P10480 

*->ivaf GDSlTdg eayygdsdgggwgagladrL 

iv+fGDSl+d+++ ++ ++ +++++++ +++s+g w ++1 + + 
P10480 28 lVMFGDSLSDTgkinyskmrgylpssppYyEGRFSNGPVWLEQ:.TNEF 74 


P104B0 


tall. . rlrarprgvdvfnrgisGrtsdGrlivDalvallFlaqslglpn 


+1 + ++++++++ +n+ + 
75 PGLTiaNEAEGGPTAVAYNBaSWKPK- 


100 


P10480 101 


LpPYLsgdflrGANFAsagAtllptsgpfliQvqFkdfksqvlelrqalg 

++ ++ 

YQVINN 106 


P10480 


llqellrllpvldakspdlvtimiGtNDlitsaf fgpkstesdrnvsvpe 
l++e+ ++1 +4+ k+ dlv++++G+HIH ++ ++ ++++++ 

107 LDYEVTQFLQKDSFKPDDLVILWVGAITOY LAYGWNTEQDAKR 148 


P104B0 


fkdnlrqliJtrLrsnngariivlitlvilnlgplGClPlltlalalasskn 
++d ++++++ r+ nga+ ++++nl+ 1G+ P+ 
149 VRDAISDAANRMV-LNGAK EILLFNLPDLGQNPS 


181 


vdasgclerlneavadfnealrelaiskledqlrkdglpdvkgadvpyvD 
+e + ++a++n++l +la +ql+++g+++++++d ++++ 
P10480 182 ARSQKWEAASHVSAYHNQLLLNLA RQLAPT01VKI.FEIDKQFAE 226 

lysif qdldgiqnpsayv.y GFe . ttkaCCGyGgr . yNyn . rv.CG 

+ +q+++ + + +a+++++ +++ +++a+++++++ +N+++r+ ++ 
P10480 227 MLRDPQNFGLSDQBNACYgGsyvwKPFaSRSASTDSQLSaFNPQeRLalA 276 

nag .I.e. nvtakaC . npssyll . sf IfwDgf HpsekGykavAeaK-* 
+++ 1 + ++++a++ +S+ ++++++ fwD++Hp+ ++a+ e 
P10480 277 GNPlLaQaVASPMAArSASTLNCeGKMFWDQfVHPTTVVHAALSEPA 322 


Alignment of pfam00657.6 consensus sequence with AAG09804 

*->ivafGDSlTdg eayygdsdgggwgagladrL 

iv+fGDSl+d+++ ++ ++ +++++++ +++s+g w ++1 + + 
AAG09804 28 IVMFGDSLSDTgkmystanrgylpssppYYEGRFSNGPVWLEQLTKQF 74 


AAG09804 


tallrlrarprgvdvfnrgisGrtsdGrlivDalvallFlaqslglpnLp 
+g+++ n + +G+t 
75 PGLTIANEAEGGAT 


88 


AAG09804 


AAG09804 


AAG09B04 


AAG09804 


AAG09804 


AAG09804 


89 


PYLsgdflrGANFAsagAtllptsgpfliQvqFkdfksqvlelrqa. . . . 

++++ + ++++ + 

AVAYNKISWNpkyq 102 


. . IgllqellrllpvldakspdlvtimiGtNDlitsaf f gpkstesdrnv 
++l++e+ ++1 +++ k+ dlv++++G+NtH- ++ ++ ++ 

103 vyNNI»DYEVTQFLQKDSFKPDDI.VILWVGANDY lAYGHNTEQ 144 


svpefkdnlrqlikrLrsnngariivlitlvilnlgplGClPlklalala 
++++++d -l-+++++r+ nga+ +++++nl+ 1G+ P+ 
145 DAKRVRDAISDAANRMV-LKGAK QILLFNLPDLGQNPS 


181 


ssknvdasgclerlneavadfnealrelaiskledqlrkdglpdvkgadv 
++4+ +e + ++a++n+-H +la +ql+++g+++++++d 
182 TVRSQKWEAVSHVSAYHNKLLLNIA RQLAPTGHVKLFEIDK 222 

pyvDlysif qdldgiqnpsayv . y GFe . tt kaCCGyGgr . yNyn . r 

+++++ +q+++ + ++ +++++ +++ t++ +++ +++ + +++r 
223 QFAEMLRDPQHFGLSDVENPCYdGgyvwKPFaTRSVSTDRQLSaFSPQeR 272 

V. CGnag, 1 . c .nvtakaC . npssyll . sf IfwDgfHpsekGykavAeal 
+ +++++ 1 + ++++a++ +s ++++++ fwD++Hp+ +-fa+ e+ 
273 I^IAtaJPlLaOaVASPHaRrSASPI^CeGKMFWDOVHPTTVVHAALSERA 322 
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AAG09804 

Alignment of pfain00657.6 consensus sequence with NP_631558 

*->ivafGDSlTdgeayygcisdgggwgagladr3jtallrlrarprgvdvf 
+va-Hn)S ++g +g + + + + ++ + 

HP 631558 42 YVMiGDSYSAG SGVLPVDPAKL LCLRSTANYPHV 75 


NP 631558 


nrgisGrtsdGrlivD. a . 1 .vallFlaqslglpnLpPYLsgdf IrGTiNF 
+ ++G++ D + + + 
76 lADTTGAR LTDvTcGaAQ 93 


NP 631558 


94 


AsagAtllptsgpfliQvqFkdfksqvlelrqalgllqellrllpvldak 

+++ ++ + ++ +++ 
TADFTRAQYPGVAPQI*DALGT 114 


spdlvtimiGtNDl itsaf f gpkstesdmvsvp 

+ dlvt+ iG+ND ++ + + +++ ++++k ++ + +++ 
»P_631558 115 GTDLVTLTIGGKDNstfinaitacgtagvlSGGKGSPCKDKHGTSFDDEI 164 

efkdn. .IrqlikrLrs.nngariivlitlvilnlg plG 

e +++ 1++++ +r-hf+ +ar+ +1 ++i+++ +++ + + G 
NP_631558 165 EANTYpaLKEALl»GVRArAPHARVAALGyPWITPATadpscflklplAAG 214 

ClPlklalalassknvdasgclerlneavadfnealrelaiskledqlrk 
P+ 1+ ++a n a+r a 
NP 631558 215 DVPY LRAIQAHLNDAVRRAA 234 


NP 631558 


dglpdvkgadvpyvDlysifqdldgiqnpsayvyGFettkaCCGyGgryN 
++ + +yvD+ ++ 

235 EETGATYVDFSGVSDG 


250 


NP 631558 251 


ynrvCGnaglcnvtakaC .npssyll . sf IfwDgf . . . HpsekGykavAe 
++aC+ p +++ + If + + + Hp++ G +++Ae 
HDACeAPGTRWIePLLFGHSLvpvHPNALGERBMAE 2B6 


NP 631558 


al<- 
+ 

287 HT 


288 


Alignment of pfam00657.6 consensus sequence with CAC42140 

♦->ivafGIJSlTdgeayygdsdgggwgagladrLtallrlrarprgvdvf 
+va+GDS ++g +g + + + + ++ + 

CAC42140 42 YVALGDSYSAG SGVLPVDPANL LCLRSTANYPHV 75 


CAC42140 


nrgisGrtsdGrlivD. a . 1 . vallFlaqslglpnLpPYLsgdf IrGANF 
+ ++G-H- D + + + 
76 lADTTGAR LTDvTcGaAQ 93 


CAC42140 


AsagAtllptsgpfliQvqFkdfksqvlelrqalgllqellrllpvldak 

+++ -H- + ++ +++ 
94 TADFTRAQYPGVAPQLDALGT 114 


CAC42140 


SpdlvtimiGtNDl itsaf f gpkste sdmvsvp 

+ dlvt+ iG+ND ++ + + +++ ++ + +k ++ + +++ 
115 GTDLVTLTIGGNDNstfinaitacgtagvlSGGKGSPCKDRHGTSFDDEI 164 


CAC42140 


efkdn . . IrqlikrLrs . nngariivlitlvilnlg plG 

e +++ 1++++ +r+++ +ar+ +1 ++i+++ +++ + + G 
165 ETaOTpalJCEJOiGVRArAPHARVAALGYPWITPATadpscflklplAAG 214 


CAC42140 


ClPlklalalassknvdasgclerlneavadfnealrelaiskledqlrk 
p+ 1+ ++a n a+r a 
215 DVpy LRAIQAH1J3DAVRRAA 234 


CAC42140 


dglpdvkgadvpyvDlysifqdldgiqnpsayvyGFettkaCCGyGgryN 
++ + +yvl>+ ++ 

235 EETGATYVDFSGVSDG 250 
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ynrvCGnaglcnvtakaC -npssyll . sf 1 f vDgf — HpsekGyJcavAe 

++aC+ p +++ + If + + + Hp++ G +++Ae 
CAC42140 251 HDACeAPGTRHIePLLFGHSLvpvHPNALGERRMAE 286 


CAC42140 


al<- 
+ 

287 HT 


2B8 


Alignment of pfam00657.6 consensus sequence with P41734 

*->ivafGDSlTdg. . * .eayygdsdgggwgagladrLtallrlrarprg 
++fGDS+T+ +++ + + d+ ga+1 + + +r+ 
P4n34 6 FLLFGDSITEFafntRPIEDGKDQYALGAALVNEY TRK 43 


P41734 


vdvfnrgisGrtsdGrlivDalvallFlaqslglpnLpPYLsgdflrGAN 
+d+ rg+4<5+t 

44 MDILQRGFKGYT 55 


FAsagAtllptsgpfliQvqFkdfksqvlelrqalgllqellrllpvlda 

+r+al++l+e+l+ + 
P41734 56 SRMALKILPEILKH E 70 

ItspdlvtiniiGtNDlit s af f gpks t e sdmvsvpe f kdnlrqli krLr s 
+ + ti++G+NI><- ++ +++ v++pef+dn+rq++++++s 

P41734 71 SNIVMATIFLGANDA CSAGPQSVPLPEFIDNIRCJIVSLMKS 111 

nngariivlitlvilnlgplGClPlklalalassknvdasgclerlneav 
++++ii++++lv ++ ++ k ++ + + r+ne + 

P41734 112 YHIRPIIIGPGLVDREKW EKEKSEEIALGYFRTNENF 148 

adfnealrelaiskledqlrkdglpdvkgadvpyvDlysifqdldgiqnp 
a + al +la ++ +vp+v l+++fq+ +g++++ 

P41734 149 AIYSDAliAKLA NEEKVPFVAINKAFQQEGGDAWQ 182 

s a yvyGFe 1 1 kaCCGyGg ryNynrvCGn a glcnvt a ka Cnps s yll s f 1 f 
+ 1+ 
P41734 183 Q ■■ lil* 185 


P41734 


wDgfHpsekGykavAeaK-* 
Dg+H+s kGyk+++++l 
186 TDGLHFSGKGYKIFHDEL 203 
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Figure 8 


A. sal 

1 

MKKWWCIJ.GLIALTVQAADTRPAFSRIVMit3C[^DTGKr^ 60 

A.hyd ' 

1 

MKKWFVCIJCTVALTVQAADSRPAFSiaVMElSffsi^OT 60 

A. 

sal 

61 

SHGPWI£QI,TKQFPGLTIANEAEGGATAVAYHKISWNPKYQVINNIJ)YEVTQFI^ 120 
+t + 

SNGPVWLEQLTNEFPGLTl ANEAEGGPTAVAYNKI SWNPKYC3fVINHLDYEVTQFLQKDSF 12 0 

A. 

hyd 

61 

A. 

sal 

121 

KPDDLVILWVGANgYIAYGSNTEQDAKRVPDAISDAAHKMVLNGAKQI 180 

A. 

hyd 

121 

+ 

KPDDLVILWGAI§YIAYGWNTEQDAKRVRDAISDAANK^m*NGAKEILLFOT*PDI^ 180 

A. 

sal 

181 

SARSQKVVEAVSHVSAYHNKLUmARQLAPTGbaS^FEIDKQFAEMIJ^ 240 

A.hyd 

181 

SARSQKWEAASHVSAYHNQIJIJnJiRQLAPTGtiVKLFEIDKQFAEM^ 240 

A- 
A. 

sal 
hyd 

241 
241 

NPCYDGGYVWKPFATRSVSTDRQI»SAFSPQERLAIAGMPlJxAQAVASPMARRSASPI*SC^ 300 

+ +++ ++++ + + 
NACYGGSYVWKPFASRSASTDSQLSAFNPQERLAIAGNPLLAQAVASPMAARSASTLUCE 300 

A. 

sal 

301 

GKMPWDQ^mPTTVVHAALSERAATFlETQYEFLAH 335 
+ + 

A. 

hyd 

301 

GKMFWDQraPTTWHAALSEPAATFIESQYEFLAH 335 
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Figure 9 


(SEQIDNo.7) 

1 ATGAAAAAAT GGTTTGTGTG TTTATTGGGA TTGGTCGCGC TGACAGTTCA GGCAGCCGAC 

61 AGCCGTCCCG CCTTCTCCCG GATCGTGATG TTTGGCGACA GCCTCTCCGA TACCGGCAAG 

121 ATGTACAGCA AGATGCGCGG TTACCTCCCC TCCAGCCCCC CCTACTATGA GGGCCGCTTC 

181 TCCAACGGGC CCGTCTGGCT GGAGCAGCTG ACCAACGAGT TCCCGGGCCT GACCATAGCC 

241 AACGAGGCGG 7UVGGCGGACC GACCGCCGTG GCTTACAACA AGATCTCCTG Q^TCCCAAG 

301 TATCAGGTCA TCAACAACCT GGACTACGAG GTCACCCAGT TCCTGCAAAA AGACAGCTTC 

361 AAGCCGGACG ATCTGGTGAT CCTCTGGGTC GGCGCCAACG ACTATCTGGC CTATGGCTGG 

421 AACACAGAGC AGGATGCCAA GCGGGTGCGC GACGCCATCA GCGATGCGGC CAACCGCATG 

481 GTGCTGAACG GCGCCAAGGA GATACTGCTG TTCAACCTGC CGGATCTGGG CCAGAACCCC 

541 TCGGCCCGCA GCCAGAAGGT GGTCGAGGCG GCCAGCCATG TCTCCGCCTA CCACAACCAG 

601 CTGCTGCTGA ACCTGGCACG CCAGCTGGCT CCCACCGGCA TGGTGAAGCT GTTCGAGATC 

661 GACAAGCAGT TTGCCGAGAT GCTGCGTGAT CCGCAGAACT TCGGCCTGAG CGACCAGAGG 

721 AACGCCTGCT ACGGTGGCAG CTATGTATGG TIAGCCGTTTG CCTCCCGCAG CGCCAGCACC 

781 GACAGCCAGC TCTCCGCCTT CAACCCGCAG GAGCGCCTCG CCATCGCCGG CAACCCGCTG 

841 CTGGCCCAGG CCGTCGCCAG CCCCATGGCT GCCCGCAGCG CCAGCACCCT CAACTGTGAG 

901 GGCAAGATGT TCTGGGATCA GGTCCACCCC ACCACTGTCG TGCACGCCGC CCTGAGCGAG 

961 CCCGCCGCCA CCTTCATCGA GAGCCAGTAC GAGTTCCTCG CCCAC 
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Figure 10 


(SEQIDNo.8) 

1 ATGAAAAAAT GGTTTGTTTG TTTATTGGGG TTGATCGCGC TGACAGTTCA GGCAGCCGAC 

61 ACTCGCCCCG CCTTCTCCCG aATCGTGATG TTCGGCGACA GCCTCTCCGA TACCGGCAAA 

121 ATGTACAGCA AGATGCGCGG TTACCTCCCC TCCAGCCCGC CCTACTATGA GGGCCGTTTC 

IBl TCCAACGGAC CCGTCTGGCT GGAGCAGCTG ACCAAGCAGT TCCCGGGTCT GACCATCGCC 

241 AACGMGCGG AAGGCGGTGC CACTGCCGTG GCTTACAACA AGATCTCCTG GAATCCCAAG 

301 TATCAGGTCT ACAACAACCT GGACTACGAG GXCACCCAGT TCTTGCAGAA A6ACAGCTTC 

361 AAGCCGGACG ATCTGGTGAT CCTCTGGGTC GGTGCCAATG ACTATCTGGC ATATGGCTGG 

421 AATACGGAGC AGGATGCCAA GCGAGTTCGC GATGCCATCA GCGATGCGGC CAACCGCATG 

481 GTACTGAACG GTGCCAAGCA GATACTGCTG TTCAACCTGC CGGATCTGGG CCAGAACCCG 

541 TCAGCCCGCA GTCAGAAGGT GGTCGAGGCG GTCAGCCATG TCTCCGCCTA TCACAACAAG 

601 CTGCTGCTGA ACCTGGCACG CCAGCTGGCC CCCACCGGCA TGGTJW^GCT GTTCGAGATC 

661 GACAAGCAAT . TTGCCGAGAT GCTGCGTGAT CCGCAGAACT TCGGCCTGAG CGACGTCGAG 

721 AACCCCTGCT ACGACGGCGG CTATGTGTGG AAGCCGTTTG CCACCCGCAG CGTCAGCACC 

781 GACCGCCAGC TCTCCGCCTT CAGTCCGCAG GAACGCCTCG CCATCGCCGG CAACCCGCTG 

B41 CTGGCACAGG CCGTTGCCAG TCCTATGGCC CGCCGCAGCG CCAGCCCCCT CAACTGTGAG 

901 GGCAAGATGT TCTGGGATCA GGTACACCCG ACCACTGTCG TGCACGCAGC CCTGAGCGAG 

961 CGCGCCGCCA CCTTCATCGA GACCCAGTAC GAGTTCCTCG CCCACGGATG A 
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Figure 11 


(SEQIDNo. 9) 

1 ATGCCGAAGC CTGCCCTTCG CCGTGTCATG ACCGCGACAG TCGCCGCCGT CGGCACGCTC 

61 GCCCTCGGCC TCACCGACGC CACCGCCCAC GCCGCGCCCG CCCAGGCCAC TCCGACCCTG 

121 GACTAC6TCG CCCTCGGCGA CAGCTACAGC GCCGGCTCCG GCGTCCTGCC CGTCGACCCC 

181 GCCAACCTGC TCTGTCTGCG CTCGACGGCC AACTACCCCC ACGTCATCGC GGACACGACG 

241 GGCGCCCGCC TCACGGACGT CACCTGCGGC GCCGCGCAGA CCGCCGACTT CACGCGGGCC 

301 CAGTACCCGG GCGTCGCACC CCAGTTGGAC GCGCTCGGCA CCGGCACGGA CCTGGTCACG 

361 CTCACCATCG GCGGCAACGA CAACAGCACC TTCATCAACG CCATCACGGC CTGCGGCACG 

421 GCGGGTGTCC TCAGCGGCGG CAAGGGCAGC CCCTGCAAGG ACAGGCACGG CACCTCCTTC 

481 GACGACGAGA TCGAGGCCAA CACGTACCCC GCGCTCAAGG AGGCGCTGCT CGGCGTCCGC 

541 GCCAGGGCTC CCCACGCCAG GGTGGCGGCT CTCGGCTACC C6TGGATCAC CCCGGCCACC 

601 GCCGRCCCGT CCTGCTTCCT GAAGCTCCCC CTCGCCGCCG GTGACGTGCC CTACCTGCGG 

661 GCCATCCAGG CACACCTCAA CGACGCGGTC CGGCGGGCCG CCGAGGAGAC CGGAGCCACC 

721 TACGTGGACT TCTCCGGGGT GTCCGACGGC CACGACGCCT GCGAGGCCCC C6GCACCCGC 

781 TGGATCGAAC CGCTGCTCTT CGGGCACAGC CTCGTTCCCG TCCACCCCAA CGCCCTGGGC 

841 6AGCGGCGCA TGGCCGAGCA CACGATGGAC GTCCTCGGCC TGGACTGA 
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Figure 12 
(SEQroNo. 10) 


a TCAGTCCAGG CCGAGGACGT 

61 GGGGTGGACG GGAACGAGGC 

121 GGCCTCGCAG GCGTCGTGGC 

181 CTCCTCGGCG GCCCGCCGGA 

241 CACSTCaCCG GCGGCGAGGG 

301 GATCCACGGG TAGCCGRGAG 

361 CAGCGCCTCC TTGAGCGCGG 

421 GTGCCTGTCC TTGCAGGGGC 

481 CGTGATGGCG TTGATGTIAGG 

541 CGTGCCGGTG CCGAGCGCGT 

601 GTCGGCGGTC TGCGCGGCGC 

661 GATjSACGTGG GGGTAGTTGG 

721 CAGGACGCCG GAGCCGGCGC 

781 GGCCTGGGCG GGCGCGGCGT 

841 GGCGGCGACT GTCGCGGTCA 
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CCATCGTGTG CTCGGCCATG 
TGTGCCCGAA GAGCAGCGGT 
CGTCGGACAC CCCGGAGAAG 
CCGCGTCGTT GAGGTGTGCC 
GGAGCTTCAG GAAGCAGGAC 
CCGCCACCCT GGCGTGGGGA 
GGTACGTGTT GGCCTCGATC 
TGCCCTTGCC GCCGCTGAGG 
TGCTGTTGTC GTTGCCGCCG 
CCAACTGGGG TGCGACGCCC 
CGCAGGT6AC GTCCGTGAGG 
CCGTCGAGCG CAGACAGAGC 
TGTAGCTGTC GCCGAGGGCG 
6GGCGGTGGC- GTCGGTGAGG 
TGACACGGCG AAGGGCAGGC 


CGCCGCTCGC CCAGGGCGTT 
TCGATCCAGC GGGTGCCGGG 
TCCACGTAGG TGGCTCCGGT 
TGGATGGCCC GCAGGTAGGG 
GGGTCGGCGG TGGCCGGGGT 
GCCCTGGCGC GGACGCCGAG 
TCGTCGTCGA AGGAGGTGCC 
ACACCCGCCG TGCCGCAGGC 
ATGGTGAGCG TGACCAGGTC 
GGGTACTGGG CCCGCGTGAA 
CGGGCGCCCG TCGTGTCCGC 
AGGTTGGCGG GGTCGACGGG 
ACGTAGTCCA GGGTCGGAGT 
CCGAGGGCGA GCGTGCCGAC 
TTCGGCAT 
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Figure 13 
(SEQIDNo. 11) 


1 ATGGATTACG AGAAGTTTCT GTTATTTGGG GATTCCATTA CTGAATTTGC TTTTAATACT 

61 AGGCCCATTG AAGATGGCAA AGATCAGTAT GCTCTTGGAG CCGCATTAGT CAACGAATAT 

121 ACGAGAAAAA TGGATATTCT TCAAAGAGGG TTCAAAGGGT ACACTTCTAG ATGGGCGTTG 

181 AAAATACTTC CTGAGATTTT AAAGCATGAA TCCAATATT6 TCATGGCCAC AATATTTTTG 

241 GGTGCCAACG ATGCATGCTC AGCAGGTCCC CAAAGTGTCC CCCTCCCCGA ATTTATCGAT 

301 AATATTCGTC AAATGGTATC TTTGATGAAG TCTTACCATA TCCGTCCTAT TATAATAGGA 

361 CCGGGGCTAG TAGATAGAGA GAAGTGGGAA AAAGAAAAAT CTGAAGAAAT AGCTCTCGGA 

421 TACTTCCGTA CCAACGAGAA CTTTGCCATT TATTCCGATG CCTTAGCAAA ACTAGCCAAT 

481 GAGGAAAAAG TTCCCTTCGT GGCTTTGAAT AAGGCGTTTC AACAGGAAGG TGGTGATGCT 

541 TGGCAACAAC TGCTAACAGA TGGACTGCAC TTTTCCGGAA AAGGGTACAA AATTTTTCAT 

601 GACGAATTAT TGAAGGTCAT TGAGACATTC TACCCCCAAT ATCATCCCAA AAACATGCAG 

661 TACAAACTGA AAGATTGGAG AGATGTGCTA GATGATGGAT CTAACATAAT GTCTTGA 
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Figure 14 
(SEQIDNo. 12) 


10 20 30 40 50 60 

t t 1 I 1 I 

MNLRQWMGAA TAAIALGLAA CGGGGTDQSG NPNVAKVQRM WPGDSI^DI GTYTPVAQAV 

70 80 90 100 110 120 

I I I I t I 

GGGKFTTNPG PIWAETVAAQ LGVTLTPAVM GYATSVCZNCP KAGCFDYAQG GSRVTDPiJGI 

130 140 150 160 170 180 

I I I I I I 

GHHGGAGALT YPVQQQLANF YAASNOTFNG NNDWFVLAG SNDIFFIfTTA AATSGSGVTP 

190 200 210 220 230 240 

1 I I I 1 1 

AIATAQVQQA ATDLVGYVKD MITUtGATQVY VFNLPDSSLT PDGVASGTTG QALLHALVGT 

250 260 270 280 290 300 

i I I I t I 

FNTTLQSGLA GTSARIIDm AQLTAAIQNG ASFGFAHTSA RACDATKINA LVPSAGGSSL 
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^310 320 330 340 

! I I » 

FCSANTLVAS GADQSYLFAD GVHPTTAGHR LIASNVIAKL LADNVAH 
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Figure 15 
(SEQIDNo. 13) 


atgaacctgc gtcaatggat gggcgccgcc acggctgccc ttgccttggg cttggccgcg 60 

tgcgggggcg gtgggaccga ccagagcggc aatcccaatg tcgccaaggt gcagcgcatg 120 

gtggtgttcg gcgacagcct gagcgatatc ggcacctaca cccccgtcgc gcaggcggtg 180 

ggcggcggca agttcaccac caacccgggc ccgatctggg ccgagaccgt ggccgcgcaa 240 

ctgggcgtga cgctcacgcc ggcggtgatg ggctacgcca cctccgtgca gaattgcccc 300 

aaggccggct get teg acta tgcgcagggc ggctcgcgcg tgaccgatcc gaacggcatc 360 

ggccacaacg gcggcgcggg ggcgctgacc tacccggttc agcagcagct cgccaacttc 420 

tacgcggcca gcaacaacac attcaacggc aataacgatg tcgtcttcgt gctggccggc 480 

agcaacgaca ttttcttctg gaccactgcg gcggccacca gcggctccgg cgtgacgccc 540 

gccattgcca cggcccaggt gcagcaggcc gcgacggacc tggtcggcta tgtcaaggac 600 

atgatcgcca agggtgcgac gcaggtctac gtgttcaacc tgcccgacag cagcctgacg 660 

ccggacggcg tggcaagcgg cacgaccggc caggcgctgc tgcacgcgct ggtgggcacg 720 

ttcaacacga cgctgcaaag cgggctggcc ggcacctcgg cgcgcatcat cgacttcaac 780 

gcacaactga ccgcggcgat ccagaatggc gcctcgttcg gcttcgccaa caccagcgcc 840 

cgggcctgcg acgccaccaa gatcaatgcc ctggtgccga gcgccggcgg cagctcgctg 900 

ttctgctcgg ccaacacgct ggtggcttcc ggtgcggacc agagctacct gttcgccgac 960 

ggcgtgcacc cgaccacggc cggccatcgc ctgatcgcca gcaacgtgct ggcgcgcctg 1020 

ctggcggata acgtcgcgca ctga 1044 
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1 migsyvavgd sftegvgdpg pdgafvgwad rlavlladrr pegdftytnl avrgrlldqi 
61 vaeqyprwg lapdlvsfaa ggndiirpgt dpdevaerfe lavaaltaaa gtvlvttgfd 
121 trgvpvlkhl rgJciatyngh vraiadrygc pvldlwslrs vqdrrawdad rlhlspeght 
181 rvalxagqal glrvpadpdq pwpplpprgt Idvrrddvhw areylvpwig rrlrgessgd 
241 hvtakgtlsp daiktriaav a 


Figure 17 (SEQ ID No. 15) 

1 gtgatcgggt cgtacgtggc ggtgggggac agcttcaccg agggcgtcgg cgaccccggc 
61 cccgacgggg cgttcgtcgg ctgggccgac cggctcgccg tactgctcgc ggaccggcgc 
121 cccgagggcg acttcacgta cacgaacctc gccgtgcgcg gcaggctcct cgaccagatc 
181 gtggcggaac aggtcccgcg ggtcgtcgga ctcgcgcccg acctcgtctc gttcgcggcg 
241 ggcggcaacg acatcatccg gcccggcacc gatcccgacg aggtcgccga gcggttcgag 
301 ctggcggtgg ccgcgctgac cgccgcggcc ggaaccgtcc tggtgaccac cgggttcgac 
361 acccgggggg tgcccgtcct caagcacctg cgcggcaaga tcgccacgta caacgggcac 
421 gtccgcgcca tcgccgaccg ctacggctgc ccggtgctcg acctgtggtc gctgcggagc 
481 gtccaggacc gcagggcgtg ggacgccgac cggctgcacc tgtcgccgga ggggcacacc 
541 cgggtggcgc tgcgcgcggg gcaggccctg ggcctgcgcg tcccggccga ccctgaccag 
601 ccctggccgc ccctgccgcc gcgcggcacg ctcgacgtcc ggcgcgacga cgtgcactgg 
661 gcgcgcgagt acctggtgcc gtggatcggg cgccggctgc ggggcgagtc gtcgggcgac 
721 cacgtgacgg ccaaggggac gctgtcgccg gacgccatca agacgcggat cgccgcggtg 
781 gcctga 
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Figure 18 
(SEQIDNo. 16) 

1 mqtnpaytsl vavgdsfteg msdllpdgsy rgwadllatr maarspgfry anlavrgkli 
61 gqivdeqvdv aaamgadvit Ivgglndtlr pkcdmarvrd lltqaverla phceqlvlmr 
121 spgrqgpvle rfrprroealf aviddlagrh gavwdlyga qsladprmwd vdrlhltaeg 
181 hrrvaeavwq slghepedpe whapipatpp pgwv-trrtad vrfarqhllp wigrrltgrs 
241 sgdglpakrp dllpyedpar 


Figure 19 (SEQIDNo. 17) 


1 atgcagacga accccgcgta caccagtctc gtcgccgtcg gcgactcctt caccgagggc 

61 atgtcggacc tgctgcccga cggctcctac cgtggctggg ccgacctcct cgccacccgg 

121 atggcggccc gctcccccgg cttccggtac gccaacctgg cggtgcgcgg gaagctgatc 

181 ggacagatcg tcgacgagca ggtggacgtg gccgccgcca tgggagccga cgtgatcacg 

241 ctggtcggcg ggctcaacga cacgctgcgg cccaagtgcg acatggcccg ggtgcgggac 

301 ctgctgaccc aggccgtgga acggctcgcc ccgcactgcg agcagctggt gctgatgcgc 

361 agtcccggtc gccagggtcc ggtgctggag cgcttccggc cccgcatgga ggccctgttc 

421 gccgtgatcg acgacctggc cgggcggcac ggcgccgtgg tcgtcgacct gtacggggcc 

481 cagtcgctgg ccgaccctcg gatgtgggac gtggaccggc tgcacctgac cgccgagggc 

541 caccgccggg tcgcggaggc ggtgtggcag tcgctcggcc acgagcccga ggaccccgag 

601 tggcacgcgc cgatcccggc gacgccgccg ccggggtggg tgacgcgcag gaccgcggac 
661 gtccggttcg cccggcagca cctgctgccc tggataggcc gcaggctgac cgggcgctcg 
721 tccggggacg gcctgccggc caagcgcccg gacctgctgc cctacgagga ccccgcacgg 
781 tga 
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1 latrgrdggag apptkhrall aaivtlivad. 
61 apastgavnrg awatapaaae pgt.ettglag 
121 vthasialaa gpdtaaalad tmrrltfggs 
181 yspipsgpvt yhpqarqtsy ladgdrtadv 
241 afgdsitdga rsqsdanhrw tdvlaarlhe 
301 padnpsglsr fqrdvlertn vkawwlgv 
361 Irwgatitp fggyggytea retmrqevue 
421 ydsgdhlhpg dXgyamgav idlaalkgaa 


saaiyagasa ddgsrdhalg aggrlprgda 
rsvrnwhts vggtgaritl snlygqsplt 
arviipaggq vmsdtarlai pyganvlvtt 
tavayttptp ywryltaldv Isheadgtw 
aagdgrdtpr yswnegisg nrlltsrpgr 
ndvlnspela drdailtglr tlvdraharg 
eirsgrvfdt wdfdkalrd pydprnarsd 
pvka 


Figure 21 (SEQIDNo. 19) 


1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 


atgacccggg 
gcggcgatcg 
gacgacggca 
gcccccgcgt 
ccgggcaccg 
gtcggcggca 
gtcacacacg 
accatgcgcc 
gtgatgagcg 
tactccccca 
ctggccgacg 
tactggcgct 
gcgttcggcg 
accgacgtcc 
tacagcgtcg 
ccggccgaca 
gtcaaggccg 
gaccgcgacg 
ctgcgggtcg 
cgcgagacga 
gtcgtcgact 
tacgacagcg 
atcgacctgg 


gtcgtgacgg 
tcaccctgat 
gcagggacca 
ccaccggtgc 
agacgaccgg 
ccggcgcgcg 
cctcgatcgc 
ggctcacctt 
acaccgcccg 
tcccgtccgg 
gcgaccgcac 
acctgaccgc 
actccatcac 
tcgccgcacg 
tcaacgaggg 
acccgagcgg 
tcgtcgtcgt 
ccatcctgac 
tcggcgccac 
tgcggcagga 
tcgacaaggc 
gcgaccacct 
ccgcgctgaa 


gggtgcgggg 
agtggcgatc 
cgcgctgcag 
ctgggtgggc 
cctggcgggc 
gatcaccctc 
cctggccgcc 
cggcggcagc 
cctcgccatc 
gccggtgacc 
ggcggacgtc 
cctcgacgtg 
cgacggcgcc 
cctgcacgag 
catcagcggc 
actgagccgg 
cctcggcgtc 
cggcctgcgc 
gatcacgccg 
ggtcaacgag 
cctgcgcgac 
gcaccccggc 
gggcgcggcg 


gcgcccccca 
tccgcggcca 
gccggaggcc 
gcctgggcca 
cgctccgtgc 
tcgaacctgt 
gggcccgaca 
gcccgggtga 
ccctacgggg 
taccatccgc 
accgccgtcg 
ctgagccacg 
cgctcgcaga 
gcggcgggcg 
aaccggctcc 
ttccagcggg 
aacgacgtcc 
accctcgtcg 
ttcggcggct 
gagatccgct 
ccgtacgacc 
gacaaggggt 
ccggtcaagg 


ccaagcaccg 
tatacgccgg 
gtctcccacg 
ccgcaccggc 
gcaacgtcgt 
acgggcagtc 
ccgccgccgc 
tcatcccggc 
cgaacgtcct 
aggcccggca 
cgtacaccac 
aggccgacgg 
gcgacgccaa 
acggccggga 
tgaccagcag 
acgtgctgga 
tgaacagccc 
accgggcgca 
acggcggcta 
ccggccgggt 
cgcgccggat 
acgcgcgcat 
cgtag 


tgccctgctc 
agcgtccgcg 
aggagacgcc 
cgcggccgag 
gcacacctcg 
gccgctgacc 
gatcgccgac 
gggcggccag 
ggtcaccacg 
gaccagctac 
ccccacgccc 
cacggtcgtg 
ccaccgctgg 
cacgccccgc 
gccggggcgg 
acgcaccaac 
ggaactcgcc 
cgcccgggga 
caccgaggcc 
cttcgacacg 
gcgctccgac 
gggcgcggtc 
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Figure 22 (SEQ ID No. 20) 

1 mtsmsrarva rriaagaayg gggigXagaa avglwaevq larrrvgvgt ptrvpnaqgl 
61 yggtlptagd pplrlmmlgd staagqgvhr agqtpgalla sglaavaerp vrlgsvagpg 
121 acsddldrq? alvlaepdrv pdicvxmvga ndvthnnpat rsvrblssav rrlrtagaev 
181 wgtcpdlgt iervrqpXrw larrasrqla aaqtigaveq ggrtvslgdl Igpefaqnpr 
241 elfgpdnyhp saegyataam avXpsvcaal glwpadeehp dalrregflp varaaaeaas 
301 eagtevaaam ptgprgpwal Ikrrrrrrvs eaepsspsgv 


Figure 23 (SEQ ED No. 21) 


1 atgacgagca tgtcgagggc gaggg'tggcg cggcggatcg cggccggcgc ggcgtacggc 
61 ggcggcggca tcggcctggc gggagcggcg gcggtcggtc tggtggtggc cgaggtgcag 
121 ctggccagac gcagggtggg ggtgggcacg ccgacccggg tgccgaacgc gcagggactg 
181 tacggcggca ccctgcccac ggccggcgac ccgccgctgc ggctgatgat gctgggcgac 
241 tccacggccg ccgggcaggg cgtgcaccgg gccgggcaga cgccgggcgc gctgctggcg 
301 tccgggctcg cggcggtggc ggagcggccg gtgcggctgg ggtcggtcgc ccagccgggg 
361 gcgtgctcgg acgacctgga ccggcaggtg gcgctggtgc tcgccgagcc ggaccgggtg 
421 cccgacatct gcgtgatcat ggtcggcgcc aacgacgtca cccaccggat gccggcgacc 
481 cgctcggtgc ggcacctgtc ctcggcggta cggcggctgc gcacggccgg tgcggaggtg 
541 gtggtcggca cctgtccgga cctgggcacg atcgagcggg tgcggcagcc gctgcgctgg 
601 ctggcccggc gggcctcacg gcagctcgcg gcggcacaga ccatcggcgc cgtcgagcag 
661 ggcgggcgca cggtgtcgct gggcgacctg ctgggtccgg agttcgcgca gaacccgcgg 
721 gagctcttcg gccccgacaa ctaccacccc tccgccgagg ggtacgccac ggccgcgatg 
781 gcggtactgc cctcggtgtg cgccgcgctc ggcctgtggc cggccgacga ggagcacccg 
841 gacgcgctgc gccgcgaggg cttcctgccg gtggcgcgcg cggcggcgga ggcggcgtcc 
901 gaggcgggta cggaggtcgc cgccgccatg cctacggggc ctcgggggcc ctgggcgctg 
961 ctgaagcgcc ggagacggcg tcgggtgtcg gaggcggaac cgtccagccc grtccggcgtt 
1021 tga 
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Figure 24 (SEQ ID No. 22) 


1 mgrgtdqrtr ygrrrarval aaltaavlgv gvagcdsvgg dspapsgsps krtrtapawd 

61 tspasvaavg dsitrgfdac avlsdcpevs watgssalcvd slavrllgka daaehswnya 

121 vtgannadlt aqvtraaqre pelvavroaga ndacrsttsa mtpvadfraq feeamatlrk 

181 klpkaqvyvs sipdlkrlws qgrtnplgkq vwklglcpsm Igdadsldsa atlrmtvrd 

241 rvadynevlr evcakdrrcr sddgavhefr fgtdqlshwd wflipsvdgqa rlaeiayrav 
301 taknp 


Figure 25 (SEQ ID No. 23) 

1 atgggtcgag ggacggacca gcggacgcgg tacggccgtc gccgggcgcg tgtcgcgctc 

61 gccgccctga ccgccgccgt cctgggcgtg ggcgtggcgg gctgcgactc cgtgggcggc 

121 gactcacccg ctccttccgg cagcccgtcg aagcggacga ggacggcgcc cgcctgggac 

181 accagcccgg cgtccgtcgc cgccgtgggc gactccatca cgcgcggctt cgacgcctgt 

241 gcggtgctgt cggactgccc ggaggtgtcg tgggcgaccg gcagcagcgc gaaggtcgac 

301 tcgctggccg tacggctgct ggggaaggcg gacgcggccg agcacagctg gaactacgcg 

361 gtcaccgggg cccggatggc ggacctgacc gctcaggtga cgcgggcggc gcagcgcgag 

421 ccggagctgg tggcggtgat ggccggggcg aacgacgcgt gccggtccac gacctcggcg 

481 atgacgccgg tggcggactt ccgggcgcag ttcgaggagg cgatggccac cctgcgcaag 

541 aagctcccca aggcgcaggt gtacgtgtcg agcatcccgg acctcaagcg gctctggtcc 

601 cagggccgca ccaacccgct gggcaagcag gtgtggaagc tcggcctgtg cccgtcgatg 

661 ctgggcgacg cggactccct ggactcggcg gcgacxrctgc ggcgcaacac ggtgcgcgac 

•721 cgggtggcgg actacaacga ggtgctgcgg gaggtctgcg cgaaggaccg gcggtgccgc 

781 agcgacgacg gcgcggtgca cgagttccgg ttcggcacgg accagttgag ccactgggac 

841 tggttccacc cgagtgtgga cggccaggcc cggctggcgg agatcgccta ccgcgcggtc 

901 accgcgaaga atccctga 
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1 mrlsrraata sallltpala Ifgasaavsa 
61 ckrstksypa Iwaashtgtr fnftacsgar 
121 adtmttcnlq gesaclaria karayiqqtl 
181 Iggscavgls eksraainaa addlnavtak 
241 vtlpvensyh ptangqslcgy Ipvlnsat 


priqatdyva Igdsyssgvg agsydsssgs 
tgdvlakqlt pvnsgtdlvs itiggndagf 
paqldqvyda idsrapaaqv wlgyprfyJc 
raadhgfafg dvnttfaghe Icsgapwlhs 


Figure 27 (SEQ ID No. 25) 


1 ttcatcacaa cgatgtcaca 
61 gacaagcctt cccgtgacga 
121 gggaggttcc atgagactgt 
181 ggcgctcgcg ctcttcggcg 
241 ctacgtggcc ctcggcgact 
301 cagtggctcc tgtaagcgca 
361 cggtacgcgg ttcaacttca 
421 gcagctgacc ccggtcaact 
481 cgcgggcttc gccgacacca 
541 gcggatcgcc aaggcgcgcg 
601 ctacgacgcc atcgacagcc 
661 cttctacaag ctgggcggca 
721 caacgccgcc gccgacgaca 
781 cgccttcggg gacgtcaaca 
841 gctgcacagc gtcacccttc 
901 caagggctac ctgcccgtcc 
961 cgaagtcccg cccccgggcg 
1021 gccggtggcc ccgccgtacg 


acaccggcca tccgggtcat 
aagggtcctg ctacatcaga 
cccgacgcgc ggccacggcg 
cgagcgccgc cgtgtccgcg 
cctactcctc gggggtcggc 
gcaccaagtc ctacccggcc 
ccgcctgttc gggcgcccgc 
ccggcaccga cctggtcagc 
tgaccacctg caacctccag 
cctacatcca gcagacgctg 
gggcccccgc agcccaggtc 
gctgcgccgt cggtctctcg 
tcaacgccgt caccgccaag 
cgaccttcgc cgggcacgag 
ccgtggagaa ctcctaccac 
tgaactccgc cacctgatct 
gggcttcgcc gtaggtgcgc 
tgccgccgcc cccggacgcg 


ccctgatcgt gggaatgggt 
aatgacagaa atcctgctca 
tccgcgctcc tcctcacccc 
ccgcgaatcc aggccaccga 
gcgggcagct acgacagcag 
ctgtgggccg cctcgcacac 
acaggagacg tgctggccaa 
attaccatcg gcggcaacga 
ggcgagagcg cgtgcctggc 
cccgcccagc tggaccaggt 
gtcgtcctgg gctacccgcg 
gagaagtccc gcgcggccat 
cgcgccgccg accacggctt 
ctgtgctccg gcgccccctg 
cccacggcca acggacagtc 
cgcggctact ccgcccctga 
gtaccgccgt cgcccgtcgc 
gtcggttc 
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Figure 28 (SEQ ID No. 26) 

1 MKKBiVCLLG LVALTVQaAD SRPAFSRIVM FGDSLSDTGK mSKMRGYLP 

51 SSPPYYEGRF SNGPVKLEOL TKOFPGLTIA NEAEGGATAV AYNKISHNPK 

101 yQVINNLDYE VTQFIQKDSF KPDDLVILW GAHDYIAYGW N^Sm 

151 mSDAANRM VLNGAKQILL ™[,PDLGQNP SftRSQKWEA 

201 LLLNIARQLA PTGMVKLFEI DKQFaEMLRD PQHE^S^ 

251 KPFATRSVSr DRQLSaFSPQ ERIAIAOJPI, LaQAVASEMa RRSASPLNCE 

301 GKMFWDQVHP TTWHaALSE RSATFIMIOY EFLRH* ««SASPUICE 
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Figure 29 (SEQ ID No. 27) 


1 ATGAAAAAAT GGTTTGTGTG TTTATTGGGA TTGGTCGCGC TGACAGTTCA 
TACTTTTTTA CCAAACACAC AAATAACCCT AACCAGCGCG ACTGTCAAGT 

51 GGCAGCCGAC AGTCGCCCCG CCTTTTCCCG GATCGTGATG TTCGGCGACA 
CCGTCGGCTG TCAGCGGGGC GGAAAAGGGC CTAGCACTAC AAGCCGCTGT 

101 GCCTCTCCGA TACCGGCAAA ATGTACAGCA AGATGCGCGG TTACCTCCCC 
CGGAGAGGCT ATGGCCGTTT TACATGTCGT TCTACGCGCC AATGGAGGGG 

151 TCCAGCCCGC CCTACTATGA GGGCCGTTTC TCCAACGGAC CCGTCTGGCT 
AGGTCGGGCG GGATGATACT CCCGGCAAM AGGTTGCCTG GGCAGACCGA 

201 GGAGCAGCT6 ACCAAACAGT TCCCGGGTCT GACCATC6CC AACGAAGCGG 
CCTCGTCGAC TGGrTTGTCA AGGGCCCAGft CTGGTAGCGG TTGCTTCGCC 

251 AAGGCGGTGC CACTGCCGTG GCTTACAACA AGATCTCCTG GAATCCCAAG 
TTCCGCCACG GTGACGGCAC CGAATGTTGT TCTAGAGGAC CTTAGGGTTC 

301 TATCAGGTCA TCAACAACCT GGACTACGAG GTCACCCAGT TCTTGCAGAA 
ATAGTCCAGT AGTTGTTGGA CCTGATGCTC CAGTGGGTCA AGAACGTCTT 

351 AGACAGCTTC AAGCCGGACG ATCTGGTGAT CCTCTGGGTC GGTGCCAATG 
TCTGTCGAAG TTCGGCCTGC TAGACCACTA GGAGACCCAG CCACGGTTAC 

401 ACTATCTGGC CTATGGCTGG AACACGGAGC AGGATGCC7A GCGGGTTCGC 
TGATAGACCG GATACCGACC TTGTGCCTCG TCCTACGGTT CGCCCAAGCG 

451 GATGCCATCA GCGATGCGGC CAACCGCATG GTACTGAACG GTGCCAAGCA 
CTACGGTAGT CGCTACGCCG GTTGGCGTAC CATGACTTGC CACGGTTCGT 

501 GATACTGCTG TTCAACCTGC CGGATCTGGG CCAGAACCCG TCAGCTCGCA 
CTATGACGAC AAGTTGGACG GCCTAGACCC GGTCTTGGGC AGTCGAGCGT 

551 GTCAGAAGGT GGTCGAGGCG GTCAGCCATG TCTCCGCCTA TCACAACCAG 
CAGTCTTCXa CCAGCTCCGC CAGTCGGTAC AGAGGCGGAT AGTGTTGGTC 

601 CTGCTGCTGA ACCTGGCACG CCAGCTGGCC CCCACCGGCA TGGTAAAGCT 
GACGACGACT TGGACCGTGC GGTCGACCGG GGGTGGCCGT ACCATTTCGA 

651 GTTCGAGATC GACAAGCAAT TTGCCGAGAT GCTGCGTGAT CCGCAGAACT 
CAAGCTCTAG CTGTTCGTTA AACGGCTCTA CGACGCACTA GGCGTCTTGA 

701 TCGGCCTGAG CGACGTCGAG AACCCCTGCT ACGACGGCGG CTATGTGTGG 
AGCCGGACTC GCTGCAGCTC TTGGGGACGA TGCTGCCGCC GATACACACC 

751 AAGCCGTTTG CCACCCGCAG CGTCAGCACC GACCGCCAGC TCTCCGCCTT 
TTCGGCAAAC GGTGGGCGTC GCAGTCGTGG CTGGCGGTCG AGAGGCGGAA 

801 CAGTCCGCAG GAACGCCTCG CCATCGCCGG CAACCCGCTG CTGGCACAGG 
GTCAGGCGTC CTTGCGGAGC GGTAGCGGCC GTTGGGCGAC GACCGTGTCC 

851 CCGTTGCCAG TCCTATGGCC CGCCGCAGCG CCAGCCCCCT CAACTGTGAG 
GGCAACGGTC AGGATACCGG GCGGCGTCGC. GGTCGGGGGA GTTGACACTC 

901 GGCAAGATGT TCTGGGATCA GGTACACCCG ACCACTGTCG TGCACGCAGC 
CCGTTCTACA AGACCCTAGT CCATGTGGGC TGGTGACAGC ACGTGCGTCG 

951 CCTGAGCGAG CGCGCCGCCA CCTTCATCGC GAACCAGTAC GAGTTCCTCG 
GGACTCGCTC GCGCGGCGGT GGAAGTAGCG CTTGGTCATG CTCAAGGAGC 

1001 CCCAC TGA 
GGGTG ACT 
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Figure 30 (SEQ ID No. 28) 


1 MKKWFVCLLG LIALTVQAAD TRPAFSRIVM FGDSLSDTGK MYSKMRGYLP 

51 SSPPYYEGRF SHGPVWLEQL TKQFPGLTIA NEAEGGATAV AYHKISWNPK 

101 YQVIKNLDYE VTQFLQKDSF KPDDLVILW7 GANDYIAYGW NTEQDAKRVR 

151 DAISDAANRM VLNGAKQILL FNLPDLGQKP SARSQKWEA VSHVSAYHNK 

201 LLLNLAROIA PTGMVKLFEI DKQFAEMLRD PQNEXSLSDVE NPCYDGGYVW 

251 ' KPFATRSVST DRQLSAFSPQ ERLAIAGNPL LAQAVASPMA RRSASPLNCE 

301 GKMFWDQVHP TTWHAALSE RAATFIETQY EFIAHG* 
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Figure31(SEQIDNo. 29) 

1 ATGAAAAAAT GGTTTGTTTG TTTATTGGGG TTGATCGCGC TGACA6TTCA 
TACTTTTTTA CCAAACAAAC AAATAACCCC AACTAGCGCG ACTGTCAAGT 

51 GGCAGCCGAC ACTCGCCCCG CCTTCTCCCG 6ATCGTGATG TTCGGCGACA 
CCGTCGGCTG TGAGCGGGGC GGAAGAGGGC CTAGCACTAC AAGCCGCTGT 

101 GCCTCTCCGA TACCGGCAAA AT6TACAGCA AGATGCGCGG TTACCTCCCC 
CGGAGAGGCT ATGGCCGTTT TACATGTCGT TCTACGCGCC AATGGAGGGG 

151 TCCAGCCCGC CCTACTATGA GGGCCGTTTC TCCAACGGAC CCGTCTGGCT 
AGGTCGGGCG GGATGATACT CCCGGCAAAG AGGTTGCCTG GGCAGACCGA 

201 GGAGCAGCTG ACCAAGCAGT TCCCGGGTCT GACCATCGCC AACGAAGCGG 
CCTCGTCGAC TGGTTCGTCA AGGGCCCAGA CTGGTAGCGG TTGCTTCGCC 

251 AAGGCGGTGC CACTGCCGTG GCTTACAACA AGATCTCCTG GAATCCCAAG 
TTCCGCCACG GTGACGGCAC CGAATGTTGT TCTAGAGGAC CTTAGGGTTC 

301 TATCAGGTCA TCAACAACCT GGACTACGAG GTCACCCAGT TCTTGCAGAA 
ATAGTCCAGT AGTTGTTGGA CCTGATGCTC CAGTGGGTCA AGAACGTCTT 

351 AGACAGCTTC AAGCCGGACG ATCTGGTGAT CCTCTGGGTC GGTGCCAATG 
TCTGTCGAAG TTCGGCCTGC TAGACCACTA GGAGACCCAG CCACGGTTAC 

401 ACrrATCTGGC ATATGGCTGG AATACGGAGC AGGATGCCAA GCGAGTTCGC 
TGATAGACCG TATACCGACC TTATGCCTCG TCCTACGGTT CGCTCAAGCG 

451 GATGCCATCA GCGATGCGGC CAACCGCATG GTACTGAACG GTGCCAAGCA 
CTACGGTAGT CGCTACGCCG GTTGGCGTAC CATGACTTGC CACGGTTCGT 

501 6ATACTGCTG TTCAACCTGC CGGATCTGGG CCAGAACCCG TCAGCCCGCA 
CTATGACGAC AAGTTGGACG GCCTAGACCC GGTCTTGGGC AGTCGGGCGT 

551 GTCAGAAGGT GGTCGAGGCG GTCAGCCATG TCTCCGCCTA TCACAACAAG 
CAGTCTTCCA CCAGCTCCGC CAGTCGGTAC AGAGGCGGAT AGTGTTGTTC 

601 CTGCTGCTGA ACCTGGCACG CCAGCTGGCC CCCACCGGCA TGGTAAAGCT 
GACGACGACT TGGACCGTGC GGTCGACCGG GGGTGGCCGT ACCATTTCGA 

651 GTTCGAGATC GACAAGCAAT TTGCCGAGAT GCTGCGTGAT CCGCAGAACT 
CAAGCTCTAG CTGTTCGTTA AACGGCTCTA CGACGCACTA GGCGTCTTGA 

701 TCGGCCTGAG CGACGTCGAG AACCCCTGCT ACGACGGCGG CTATGTGTGG 
AGCCGGACTC GCTGCAGCTC TTGGGGAC6A TGCTGCCGCC GATACACACC 

751 AAGCCGTTTG CCACCCGCAG CGTCAGCACC GACCGCCAGC TCTCCGCCTT 
TTCGGCA/iAC GGTGGGCGTC GCAGTCGTGG CTGGCGGTCG AGAGGCGGAA 

801 CAGTCCGCAG GAACGCCTC6 CCATCGCCGG CAACCCGCTG CTGGCACAGG 
GTCAGGCGTC CTTGCGGAGC GGTAGCGGCC GTTGGGCGAC GACCGTGTCC 

851 CCGTTGCCAG TCCTATGGCC CGCCGCAGCG CCAGCCCCCT CAACTGTGAG 
GGCAACGGTC AGGATACCGG GCGGCGTCGC GGTCGGGGGA GTTGACACTC 

901 GGCAAGATGT TCTGGGATCA GGTACACCCG ACCACTGTCG TGCACGCAGC 
CCGTTCTACA AGACCCTAGT CCATGTGGGC TG6TGACAGC ACGTGCGTCG 

951 CCTGAGCGAG CGCGCCGCCA CCTTCATCGA GACCCAGTAC GAGTTCCTCG 
GGACTCGCTC GCGCGGCGGT GGAAGTAGCT CTGGGTCATG CTCAAGGAGC 

1001 CCCACGGATG A 
GGGTGCCTAC T 
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Figure 32 

1 10 20 30 40 50 
• »■ *~. + 1. 1 

r.^^ „ ™TraWSRIVHFGDSLSDTGKttYSmRGyLPSSPPYYEGRFSH--G 

R.sol QSGHPHVflKVQRHWFGDSLSDIGT YTPVflQflVGGGKFTTHPG 

Consensus . . .adnraafqRiVnFGDSLSDiGk YlPsaqaageGrFsn. .G 


51 BO 70 80 90 


100 


t.^^^^ PVflEQLTKQFPGLTIRHEHEGGflTflVflYMiaSHHPKYQVIHHLDyEVTQ 
R.sol PIHflETVflflQL-GVTLTPflVMGYflTSVQNCPKRGCFDYRQGGSRVTDPHG 
Consensus P ! KaEqlaaQl.GlTianaaeGgflTaVannki^nf dYaqgnnrdt #pnq 

101 110 120 130 140 150 
I ^ ^ ^ 

satH FLQiaiSFKPDDLVIUWGRHDYLB¥G--HHTEQDRKRVRDfiISDHRKRHV 
R.S01 IGHHGGfiGflLTYPVQQQLflKFYflHSfmTFKGHHDWFVLBGSHDIFFHTT 
Consensus igqndgagaddlp ! qqqgflHdYaflsn. .flfgSfflakrVraainDaanrnt 

151 IBO 170 180 190 200 
I ^ ^ ^ ^ 

sate LHGflKQILLFNLPDLGQKPSeRSQKWEflVSHVSflYHKKL-LLKLnRQLH 
R.sol flHflTSGSGVTPftlflTRQVQQflflTDLVGYVKDMIflKGflTQVYVFHLPDSSL 
Consensus aaaakqiglf naialaQnqqRasSlVgeakdh ! aaganql.llKLarqla 

201 210 220 230 240 250 
I , ^ ^ 

satB PTGMVKLFEIDKQFHEMLRDPQHFGLSDVENPCYDGGYVHKPFnTRSVST 
R.sol TPDGVflSGTTGQBLLHflLVGTFHTTLQSGLflGTSHRIIDFHflQLTRBIQN 
Consensus PpdgValgeidqalaeaLrdpqHf gLqdgeagcsargidf naqaTaa !qn 

251 2B0 270 280 290 300 
I 1 ^ ^ ^ _ 

satfl DRQLSRFSPQERLRIfiG—HPLLBQRVRSPM RRRSflSPLNCEGKKFH 

R.sol GHSFGFflHTSRRRCDfiTKIHflLVPSRGGSSLFCSRHTLVflSGRDQSYLFfl 
Consensus daqlgaanpqaRaad{fe..llaUaqHgaSpS...Rrrlaapgad8gk^a 


301 310 320 330 


satfl DQVHPTTWHflflLSERRBTFIETQYEFLRH 
R.sol DGVHPnflGHRURSHVLRRLLR— DHVRH 
Consensus DqVHPTTagHaaiaepaaapiea. , #nlHH 
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Figure 33 

T 

Pfam *->ivafGDSltdggg ayygdsdgggwgagladrltsla. .rlrargrgvdv 

Sriml 3B ' YVALGDSYSSGVG agSYDSSSGSCKRSTKS YPALWAAS . . HTGTRF 81 

Scoel 5 YVAVGDSFTEG— — VGDPGPDGAFVGWADRLAVLL. .ADRRPEGDFTY 47 

Scoe2 10 LVAVGDSFTEG— — MSDLLPDGSYRGWADLLATRM. . — AARSPGFRY 50 

Scoe3 239 WAFGDSITDG— AI^QSDZmHIOTDVLAAKLHEAA. .GDGRDTPRYSV 283 

Scoe4 75 LMMLGDSTAAG— QGVHRAGCJTPGAUUASG. .LAAVAERPVRL 113 

ScoeS 66 VAAVGDSITRGFD. acAVLSDCPEVSVJATGSSAKVDSLAvrLLGKAEAAEHS 116 

Ahydl 28 IVMFGDSLSDTGKmyskmrgylpssppyYEGI^NGPVMLEQLTNEFPGLTiaNEAE^ 91 

Asall 28 IVMFGDSLSDTGKmyskmrgylpssppyYEGRFSNGPVWLEQLTKQF PGLTI 79 

Ahyd2 40 IVMFGDSI^SDTG KinvB fanrqylps sppvYEGRFSNGPVWLEQLTKOFP6LTia?JEAEGt^ Ta va 103 

Pfam fnrglsGrtsdGrlwDarlvatllFlaqflGlulpPYLsgdflrGANFAsagAtllgtslipflni 

Sriml 82 NFTACSGAR 90 

Scoel 48 TNLAVRGRL 56 

Scoe2 51 ANLAVRGKL 59 

Scoe3 284 VNEGISGNR 292 

Scoe4 114 GSVAQPGAC 122 

ScoeS 117 WNYAVTGAR • 125 

Ahydl 92 YNKISVINPK 100 

Asall 80 ANEMGGAT 88 

Ahyd2 104 YNKISWNPK 112 

T 

Pfam QvqFkdf kskvlelrqa IgllqellrlvpvldakspdlvtimiGtNDl . . . itvakf gpks 

Srina 91 TGDVLAKQLTPVNSGTDLVSITIGGNDAgf aDTMTTCNLQG 131 

Scoel 57 --IJX3IVAEQVPRWGLAPDLVSFAAGGNDI . . . 1 86 

Scoe2 60 --IGQIVDEQVDVAAAMGADVITLVGGLNDT... 88 

Scoe3 293 LLTSRPGRPA DNPSGLSRFQRDVLERTNVKAWWLGVNDV . . . 333 

Scoe4 123 SDDLDRQVALVIAEPDRVPDICVIMVGANDV. . . 153 

Scoe 5 126 MADLTAQVTRAAQREPELVAVMAGANDA ... CR 155 

Ahydl 101 YQVI HKLDYEVTQFLQKDSFKPDDLVILWVGANDY... LA 137 

Asall 89 AVAYNKISWNpkyqvyNNLDYEVTQFLQKDSFKPDDLVILWVGANDY... LA 137 

Ahyd2 113 YQVI NNLDYEVTQFLQKDSFKPDDLVILWGANDY> . . LA 149 


Pfam tksdmvsvpef rdnlrkllkrLrsangariiilitlvllnlpl plGCl 

Sriml 132 esaclarlAKARAYIQQTLPAQLDQVYDAIDSRAPAA QWVLGYP- 176 

Scoel 87 RPGTDPDEVAERFELAVAALT-AAAGTVLVTTGFDTRGVP- 125 

Scoe2 89 LRPKCDMARVRDLLTQAVERLAPHCEQLVLbaRSP- 122 

Scoe3 334 LNSPELADRDAILTGLRTLVDRAHARGLRWGATITPFGGYGG- 376 

Scoe4 154 THRMPATRSVRHLSSAVRRLR-TAGAEVWGTCPDLGTIE- 192 

Scoe5 156 STTSAMTPVADFRAQFEEAMATLR-KKLPKAQVYVSSIPDLKRLwsqgrtnplgkQVWKL 214 

Ahydl 138 YGWNTEQDAKRVRDAISDAANRMV-LNGAK EILLFNLP- 174 

Asall 138 YGlTOTEQCtAKRVRDAISDAANRMV-LNGAK QILLFNLP- 17 4 

Ahyd2 150 YGWNTEQDAKRVRDAISDAANRMV-LNGAK QILLFNLP- 186 


Pfam pq- klalalassknvdatgclerlneavadynealrelaei . ek. 1 . q. aqlrkdglpdlkeanvpy 

Sriml 177 —. RFYKLGGSCAVGLSEKSRAAINAAADDINAVTAKRA— . ADHGFAF 219 

Scoel 126 — . VLKHLRGKIATYNGHVRAIA — . — DRYGCPV 152 

Scoe2 123 — . GRQGPVLERFRPRMEALFAVIDDLA-- . — . GRHGAW 154 

Scoe3 377 — . YTEARETMRQEVNEErRSGRVFDTWDFDKALRDPY — 412 

Scoe4 193 — . RVRQPLRWLaRRaSrQlAAAQTIGAVEQGGRTVSL 227 

Scoe5 215 GLcPSMLGDADSLDSAATLRRNTVRDRVADYNEVLREVC — . — --.JUtDRRCRSDDGAVHEFRFGT 273 

Ahydl 175 — . DLGC2NPSARSQKWEAASHVSAYHNQLLLNLA — . ~ . - . - . RQLAPTQSVKLFEI DKQF 224 

Asall 175 ~, DLGQNPSARSQKVVEAVSHVSAyHNiaLI2nA--.--.-.-.RQLAPTGI<VKLFErDKQF 224 

Ahyd2 187 --. DLGC3JJPSARSQKVVEAVSHVSAYH»QiajmA--.--.-.-.RQLAPTGtlVKLrEIDKQF 236 


Pfam VDlysi f qdldgiqnpsayv. y GFeet . kaCCGyGgr . yNyn . rv. CGnag . 1 . ck . vt akaC 

Sriml 220 GDVNT . .-TFAgHElCSGZVPwL.HS.VT 242 

Scoel 153 LDLWSLRSVQDRRA . . , — . 166 

Scoe2 155 VDLYGAQSIADPRM .- . . . — . — \ 150 
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Scoe3 413 . -~- — * 413 

Scoe4 228 GDUXSPEFAONPREL . . . — — 242 

ScoeS 274 DQL . . — . 276 

Ahydl 225 AEtaRDPQNFGLSDQRNACYgGsyvwKPFASrSASTDSQLSaFNPQeRLalAGNPlLaQAvASPMAA 291 
Asall 225 AEMLRDPQNFGLSDVEaJPCYdGgyvwKPFATrSVSTDRQLSaFSPQeRLalAGNPlLaQRvASPMAR 291 
Ahyd2 237 AEMIJa)PQNFGLSDVEHPCYdGgyvwKPFATrSVSTDRQI.SaFSPQeRLaIAGNPlLaQavASPMA^ 303 

T 

Pf am . dassyll . atlfwDgf , HpsekGy kavAeaK-* 

Sriml 243 . . --LPVENSyHPTANGQSKGYLPV 263 

Scoel 167 . .--WDADRL.HI^PEGHTRVALRA 186 

Scoe2 169 . WDVDRL.HLTAEGHRRVAEAV 188 

Scoe3 413 .-DPRBMRsDYDSGDHL.HPGDKGYARMGAVI 441 

Scoe4 243 . . — FGPDNY.HPSAEGyATAAMAV 262 

ScoeS 277 . . — SHWDWF.HPSVDGQARIAEIA 296 

Ahydl 292 rSASTLNCeGKMFOTXJV.HPTTWHAALSEPA 322 

Asall 292 rSASPLNCeGKMFHDQV.HPTTWHAALSERA 322 

Ahyd2 304 rSASPLNCeGK MFHDQV . HPT TWHAALSERA 334 
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Figure 34 


T 

Pf am *->ivaf GDSltdggg ayygdsdgggwgagladrltsla . . rlrargrgvdv 

Sriml 38 YVALGDSYSSGVG agSYDSSSGSCKRSTKSYPALRAAS. . HT6TRF 81 

Scoel 5 YVAVGDSFTEG-- • — VGDPGPDGAFVGWADRLAVLL. .ADRRPEGDFTY 47 

Scoe2 10 LVAVGDSFTEG— --MSDLLPDGSYRGWADLLATRM . . — AARSPGFRY 50 

Ahydl 28 I\WrcDSLSDTGKmyskmrgylpssppyYEGRrSNGPVl*LEQLTNEFPGLTiaNEAEGGPTAVA 91 

Asall 28 IVMFGDSLSDTGKmystanrgylpssppyYEGRFSNGPVMLEQLTKQF PGLTI 79 

Ahyd2 40 IWFGDSLSDTGKmystonrgylpssppyY 103 

Pfam fnrgisGrtsdGrlwDarlvatllFlaqflGlnlpPYLsgdflrGAHFTVsagAtllgtslipflni 

Sriml 82 NFTACSGAR 90 

Scoel 48 TKLAVRGKL * 56 

Scoe2 51 ANLAVRGKL 59 

Ahydl- 92 YNKISWNPK 100 

Asall 80 ANEAEGGAT 88 

Ahyd2 104 YNKISWNPK 112 

T 

Pfam QvqFkdf kskvlelrqa IgllqellrlvpvldakspdlvtimiGtNDl — itvakf gpks 

Sriml 91 TGDVIAKQLTPVNSGTDLVSITIGGNDAgfaDTKTTCNLQG 131 

Scoel 57 — LDQIVAEQVPRWGLAPDLVSFAAGGNDI... 1 86 

Scoe2 60 —IGQIVDEQVDVAAAMGTIDVITLVGGLNDT.. . 88 

Ahydl 101 YQVI NNLDYEVTQFLQKDSFKPDDLVILWVGANDY... LA 137 

Asall 89 AVAYNKXSWNpkyqvyNNLDYEVTQFLQKDSFKPDDLVILWVGANDY... LA 137 

Ahyd2 113 YQVI NNLDYEVTQFLQKDS FKPPD LVILWV6ANDY . . . LA 149 


pfam tksdrnvsvpef rdnlrklikrLrsangaridLilitlvllnlplplGCl 

Sriml 132 esaclarlAKARAYIQQTLPAQLDQVYDAIDSRAPAA QWVLGYP 176 

Scoel 87 RPGTDPDEVAERFELAVAALT-AAAGTVLVTTGFDTRGVP 125- 

Scoe2 89 LRPKCDMARVRDLLTQAVERLAPHCEQLVIJ5RSP 122 

Ahydl 138 YGWNTEQDAKRVRDAISDAANFMV-LNGAK EILLFNLP 174 

Asall 138 YGWNTEQDAKRVRDAISDAANRM7-LNGAK QILLFNLP 174 

Ahyd2 150 YGWNTEQDAKRVRDAISDAANRMV-LNGAK QILLFNLP 186 


Pfam pqklalalasstnvdatgclerlneavadynealrelaeieklqaqlrkdglpdlkeanvpy 

Sriml. 177 — RFYKLGGSCAVGLSEKSRAAINAAADDINAVTAKRA ADHGFAF 219 

Scoel 126 VLKHLRGKIATYNGHVRAIA DRYGCPV 152 

Scoe2 123 GRQGPVLERFRPRMEALFAVIDDLA GRHGAW 154 

Ahydl 175 DLGQNPSARSQKWEAASHVSAYHNQLLLNLA RQLAPTGMVKLFEIDKQF 224 

Asall 175 DLGQNPSARSQKWEAVSHVSAYHNKLLLNLA RQLAPTGMVKLFEIDKQF 224 

Ahyd2 187 DLGQNPSARSQKWEAVSHVSAYHNQLLLHLA RQLAPTGMVKLFEIDKQF 236 


Pfam VDlysif qdldgiqnpsayv. y GFeet . kaCCGyGgr - yNyn . rv. CGnag . 1 . ck .vtakaC 

Sriml 220 GDVNT . .-TFAgHElCSGAPwL.HS .VT 242 

Scoel 153 LDLWSLRSVQDRRA . . . — . 166 

Scoe2 155 VDLYGAQSIADPRM ,~ . . — . — . 168 

Ahydl 225 AEMLRDPQNFGLSDQRNACYgGsyvwKPFASrSASTDSQLSaFNPQeRLalAGNPlLaQAvASPMAA 291 

Asall 225 AEMLRDPQNFGLSDVENPCYdGgyvwKPFATrSVSTDRQLSaFSPQeRLalAGNPlLaQAvASPMAR 291 

Ahyd2 237 AEMLRDPQNFGLSDVENPCYdGgyvwKPFATrSVSTDRQLSaFSPQeRLalAGNPlLaQAvASPMAR 303 

T 

Pfam .dassyll.atlfwDgf .HpsekGykavAeaK-* 

Sriml 243 . .— LPVEHSyHPTANGQSKGYLPV 263 

Scoel 167 , .— WDADRL.HLSPEGHTRVTILRA 186 

Scoe2 169 . WDVDRL.HLTAEGKRRVAEAV 188 

Ahydl 292 rSASTLNCeGKMFWDQV.HPTTWHAALSEPA 322 

Asall 292 rSTlSPLNCeGKMFWDQV.HPTTWHAALSERA 322 

Ahyd2 304 rSASPLNCeGro fFWDQV , HPTT WHAALSERA 334 
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Figure 35 


(SEQIDNo. 30) 

1 MFKFKKHFLV GLSAALMSIS LFSATASAA S ADSRPAFSRI VMFGDSLSDT 

51 GKMYSKMRGY LPSSPPYYEG RFSNGPVWLE QLTKQFPGLT lAHEAEGGAT 

101 AVAYNKISWN PKYQVINNLD YEVTQFLQKD SFKPDDLVIL WGANDYLAY 

151 GWNTEQDAKR VRDAISDAAN ItMVLNGAKQI LLFNLPDLGQ NPSARSQKW 

201 EAVSHVSAYH NQLLLNLARQ LAPTGMVKLF EIDKQFaiML RDPC^FGLSD 

251 VENPCYDGGY VWKPFATRSV STDRQLSAFS PQERLAIAGN PLLAQAVASP 

301 MARRSASPLN CEGKMFWDQV HPTTVVHAAL SERAATFIAH QYEFIAH** 
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Figure 36 (SEQIDNaSl) 

I ATGTTTAAGT TTAAAAAG3UV TTTCTTAGTT GGATTATCGG CAGCTTTAAT 
TACAAATTCA AATTTTTCTT AAAGAATCAA CCTAATAGCC GTCGAAATTA 

51 GAGTATTAGC TTGTTTTCGG CAACCGCCTC TGCAGCTAGC GCCGACAGCC 
CTCATAATCG AACAAAAGCC GTTGGCGGAG ACGTCGATCG CGGCTGTCGG 

101 GTCCCGCCTT TTCCCGGATC GTGATGTTCG GCGACAGCCT CTCCGATACC 
CAGGGCGGAA AAGGGCCTAG CACTACAAGC CGCTGTCGGA GAGGCTATGG 

151 GGCAAAATGT ACAGCAAGAT GCGCGGTTAC CTCCCCTCCA GCCCGCCCTA 
CCGTTTTACA TGTCGTTCTA CGCGCCAATG 6AGGGGAGGT CGGGCGGGAT 

201 CTATGAGGGC CGTTTCTCCA ACGGACCCGT CTGGCTGGAG CAGCTGACCA 
GATACTCCCG GCAAAGAGGT TGCCTGGGCA GACCGACCTC GTCGACTGGT 

251 AACAGTTCCC GGGTCTGACC ATCGCCAACG AAGCGGAAGG CGGTGCCACT 
TTGTCAAGGG CCCAGACTOS TAGCGGTTGC TTCGCCTTCC GCCACGGTGA 

301 GCCGTGGCTT ACAACAAGAT CTCCTGGAAT CCCAAGTATC AGGTCATCAA 
CGGCACCGAA TGTTGTTCTA GAGGACCTTA GGGTTCATAG TCCAGTAGTT 

351 CAACCTGGAC TACGAGGTCA CCCAGTTCTT GCAGAAAGAC AGCTTCAAGC 
GTTGGACCTG ATGCTCCAGT GGGTCAAGTiA CGTCTTTCTG TCGAAGTTCG 

401 CGGACGATCT GGTGATCCTC TGGGTCGGTG CCAATGACTA TCTGGCCTAT 
GCCTGCTAGA CCACTAGGAG ACCCAGCCAC GGTTACTGAT AGACCGGATA 

451 GGCTGGAACA CGGAGCAGGA TGCCAAGCGG GTTCGCGATG CCATCAGCGA 
CCGACCTTGT GCCTCGTCCT ACGGTTCGCC CAAGCGCTAC GGTAGTCGCT 

501 TGCGGCCAAC CGCATGGTAC TGAACGGTGC CAAGCAGATA CTGCTGTTCA 
ACGCCGGTTG GCGTACCATG ACTTGCCACG GTTCGTCTAT GACGACAAGT 

551 ACCTGCCGGA TCTGGGCCAG AACCCGTCAG CTCGCAGTCA GAAGGTGGTC 
TGGACGGCCT AGACCCGGTC TTGGGCAGTC GAGCGTCAGT CTTCCACCAG 

501 GAGGCGGTCA GCCATGTCTC CGCCTATCAC AACCAGCTGC TGCTGAACCT 
CTCCGCCAGT CGGTACAGAG GCGGATAGTG TTGGTCGACG ACGACTTGGA 

651 GGCACGCCAG CTGGCCCCCA CCGGCATGGT AAAGCTGTTC GAGATCGACA 
CCGTGCGGTC GACCGGGGGT GGCCGTACCA TTTCGACTUiG CTCTAGCTGT 

701 AGCAATTTGC CGAGATGCTG CGTGATCCGC AGAACTTCGG CCTGAGCGAC 
TCGTTAAACG GCTCTACGAC GCACTAGGCG TCTTGAAGCC GGACTCGCTG 

751 GTCGAGAACC CCTGCTACGA CGGCGGCTAT GTGTGGAAGC CGTTTGCCAC 
CAGCTCTTGG GGACGATGCT GCCGCCGATA CACACCTTCG GCAAACGGTG 

801 CCGCAGCGTC AGCACCGACC GCCAGCTCTC CGCCTTCAGT CCGCAGGAAC 
GGCGTCGCAG TCGTGGCTGG CGGTCGAGAG GCGGAAGTCA GGCGTCCTTG 

851 GCCTCGCCAT CGCCGGCAAC CCGCTGCTGG CACAGGCCGT TGCCAGTCCT 
CGGAGCGGTA GCGGCCGTTG GGCGACGACC GTGTCCGGCA ACGGTCAGGA 

901 ATGGCCCGCC GCAGCGCCAG CCCCCTCAAC TGTGAGGGCA AGATGTTCTG 
TACCGGGCGG CGTCGCGGTC GGGGGAGTTG ACACTCCCGT TCTACAAGAC 

951 GGATCAGGTA CACCCGACCA CTGTCGTGCA CGCAGCCCTG AGCGAGCGCG 
CCTAGTCCAT GTGGGCTGGT GACAGCACGT GCGTCGGGAC TCGCTCGCGC 

IpOl CCGCCACCTT CATCGCGAAC CAGTAC6AGT TCCTCGCCCA CTGATGA 
GGCGGTGGAA GTAGCGCTTG GTCATGCTCA AGGAGCGGGT GACTACT 


wo 2005/066347 
Figure 37 
SEQIDNO. 32: 

ACAGGCCGATGCACGGAACCGTACCTTTCCGCAGTGAAGCGCTCTCCCCCCATCGTTCGC 

CGGGACrrCATCCGCGATTTTGGCATGAACACTTCCTTCAACGCGCGTAGCTTGCTACAA 

GTGCGGCAGCAGACCCGCTCGTTGGAGGCTCAGTGAGATTGACCCGATCCCTGTCGGCCG 

CATCCGTCATCGTCTTCGCCCTGCTGCTCGCGCTGCTGGGCATCAGCCCGGCCCAGGCAG 

CCGGCCCGGCCTATGTGGCCGTGGGGGATTCCTATTCCTCGGGCAACGGCGCCGGAAGTT 

ACATCGATTGGAGCGGTGACTGTCACCGCAGCAACAACGCGTACCCCGCCCGGTGGGGGG 

CGGCCAACGCACCGTCCTCCTTCACCTTCGCGGCCTGCTCGGGAGCGGTGACCACGGATG 

TGATCAACAATCAGCTGGGCGCCCTCAACGGGTCCACCGGCCTGGTGAGCATCAGCATCG 

GCGGCAATGACGCGGGCTTCGCGGACGCGATGACCACCTGCGTCACCAGCTCGGACAGCA 

CCTGCCTCAACCGGCTGGCCACCGCCACCAACTACATCAACACCACGCTGCTCGCCCGGC 

TCGACGCGGTCTACAGCCAGATCAAGGCCGGTGCCCCCAACGCCCGCGTGGTCGTCCTCG 

GCTACCCGGGCATGTACCTGGCCTGGAACCCGTGGTACTGCCTGGGGCTGAGCAACACCA 

AGCGCGCGGCCATCAACACGAGGGCCGAGACCGTGAAGTCGGTGATCTCCTCCGGGGCCA 

CCGCCCAGGGATTCCGATTCGGCGATGTCCGCCGGAGCTTCAACAACCAGGAAGTGTTCT 

TCGGCAACGACTGGCTGCACTCAGTCACCCTGCGGGTGTGGGAGTCGTACCAGCGCACCA 

GCACGGGCCATCAGAGCGGCTATCTGCCGGTCCTCAAGGCCAACAGCTCGACCTGATCAA 

CGCACGGCCGTGCCCGCGCCGCGCGTCACGCTCGGCGCGGGCGCCGCAGCGCGTTGATCA 

GCCCACAGTGCCGGTGACGGTGCCACCGTCACGGTCGAGGGTGTACGTCACGGTGGCGCG 

GCTCCAGAAGTGGAACGTCAGCAGGACCGTGGAGCCGTCCCTGACCTCGTCGAAGAACTC 

CGGGGTCAGCGTGATCACCCGTCCCCCGTAGCCGGGGGCGAAGGCGGCGCCGAACTCCTT 

GTAGGACGTCCAGTCGTGCGGCCCGGCGTTGCCACCGTCCGCGTAGACCGCTrCGATGGT 

CGCCAGCCGGTCCCCGCGGAACTCGGTGGGGATGTGCGTGCCCAAGGTGGTCCGGGTGGT 

GTCCGAGAGCACCGGGGGCTCGTACCGGATGATGTGCAGATCCAAAGAATT 

FIGURE 38 
SEQ ID NO. 33: 

MRLTRSLSAASVIVFALLLALLGISPAQAAGPAYVALGDSYSSGNGAGSYIDSSGDCHRSN 

MAYPARWAAANAPSSFTFAACSGAVniDVlNNQLGAU^STGLVSmGGNDAGFAD^ 

CWSSDSTCLNRl^TAT^m^f^TLL^RLDAVYSQ 

LGLSNTKRAA!NTTADTI>JSVISSRATAHGFRFGDVRPTFNNHEIPFGNDV\^ 
SYHPTSTGHQSGYIPVLNANSST 
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Figure 39 

SEQIDNo.34 
ZP 00058717 

1 mlphpagerg evgaffallv gtpqdnir! echetrplrg rcgcgerrvp pftlpgdgvl 
61 cttsstrdae tvAvrkhlqpr pdggfrphlg vgdiagqgs pgvlwcgreg crfevcrrdt 
121 pglsrtmgd ssppfragws Ippkcgeisq sarktpavpr ysllrtdrpd gprgrfvgsg 
181 praatmtf IgipatvM aftlvlavpt gretlwrmwc eatqdwclgv pvdsrgqpae 
241 dgeflllspv qaatwgnyya Igdsyssgdg ardyypgtav kggcwrsana ypefvaeayd 
301 faghlsflac sgqrgyamid aidevgsqld wnsphtslvt igiggndlgf stvlktcmvr 
361 vplldskact dqedairkrm akfettfeel tsevrtrapd arilwgypr ifpeeptgay 
421 ytltasnqrw Inettqefhq qiaeavavhd eeiaasggvg svefvdvyha Idgheigsde 
481 pwvngvqird latgvtvdrs tfhpnaaghr avgervieqi etgpgrplya tfewagatv 
541 dtlagevg 
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nGURE40 
SEQDDNo. 35 

1 ggtggtgaac cagaacaccc ggtcgtcggc gtgggcgtcc aggtgcaggt gcaggttctt 
61 caactgctcc agcaggatgc cgccgtggcc gtgcacgatg gccttgggca ggcctgtggt 
121 ccccgacgag tacagcaccc atagcggatg gtcgaacggc agcggggtga actccagttc 
181 cgcgccttcg cccgcggcttcgaactccgc ccaggacagg gtgtcggcga cagggccgca 
241 gcccaggtac ggcaggacga cggtgtgctg caggctgggc atgccgtcgc gcagggcttt 
301 gagcacgtca cggcggtcga agtccttacc gccgtagcgg tagccgtcca cggccagcag 
361 cactttcggt tcgatctgcg cgaaccggtc gaggacgctg cgcaccccga agtcggggga 
421 acaggacgac cagglcgcac cgatcgcggc gcaggcgagg aatgcggccg tcgcctcggc 
4ai galgttcggc aggtaggcca cgacccggtc gccggggccx: accccgaggc tgcggagggc 
541 cgcagcgatc gcggcggtgc gggtccgcag ttctccccag gtccactcgg tcaacggccg 
601 gagttcggac gcgtgccgga tcgccacggc tgatgggtca cggtcgcgga agatgtgctc 
661 ggcgtagttg agggtggcgc cggggaacca gacggcgccg ggcatggcgt cggaggcgag 
721 caclgtggtg tacggggtgg cggcgcgcac ccggtagtac tcccagatcg cggaccagaa 
781 tccttcgagg tcggttaccg accagcgcca cagtgcctcg tagtccggtg cgtccacacc 
841 gcggtgctcc cgcacccagc gggtgaacgc ggtgaggttg gcgcgttctt tgcgctcctc 
901 gtcgggactc cacaggatcg gcggctgcgg cttgagtgte atgaaacgcg accccttcgt 
961 ggacggtgcg gatgcggtga gcgtcgggtg cctcccctaa cgctccccgg tgacggagtg 
1021 ttgtgcacca catctagcac gcgggacgcg gaaaccgtat ggagaaaaca cctacaaccc 
1081 cggccggacg gtgggtttcg gccacactta ggggtcgggt gcctgcttgc cgggcagggc 
1141 aglcccgggg tgctgtggtg cgggcgggag ggctgtcgct tcgaggtgtg ccggcgggac 
1201 actccgggcc tcagccgtac ccgcaacggg gacagttctc ctcccttccg ggctggatgg 
1261 tcccttcccc cgaaatgcgg cgagatctcc cagtcagccc ggaaaacacc cgctgtgccc 
1321 aggtactctt tgcttcgaac agacaggccg gacggtccac gggggaggtt tgtgggcagc 
1381 ggacx^acgtg cggcgaccag acgacggttg ttcctcggta tccccgctct tgtacttgtg 
1441 acagcgctca cgctggtctt ggctgtcccg acggggcgcg agacgctgtg gcgcatgtgg 
1501 tgtgaggcca cxxaggactg gtgcctgggg gtgccggtcg actcccgcgg acagcctgcg 
1561 gaggacggcg agtttctgct gctttctccg gtccaggcag cgacctgggg gaactattac 
1621 gcgctcgggg attcgtactc ttcgggggac ggggcccgcg actactatcc cggcaccgcg 
1681 gtgaagggcg gttgctggcg gtccgctaac gcxtetccgg agctggtcgc cgaagcctac 
1741 gactlcgccg gacacttgtc gltcctggcc tgcagcggcc agcgcggcta cgccatgctt 
1801 gacgctatcg acgaggtcgg ctcgcagctg gactggaact cccctcacac gtcgctggtg 
1861 acgatcggga tcggcggcaa cgatctgggg ttctccacgg ttttgaagac ctgcatggtg 
1921 cgggtgccgc tgctggacag caaggcgtgc acggaccagg aggacgctat ccgcaagcgg 
1981 atggcgaaat tcgagacgac gtttgaagag ctcatcagcg aagtgcgcac ccgcgcgccg 
2041 gacgcccgga tccttgtcgt gggctacccc cggatttttc cggaggaacc gaccggcgcc 
2101 tactacacgc tgaccgcgag caaccagcgg tggctcaacg aaaccattca ggagttcaac 
2161 cagcagctcg ccgaggctgt cgcggtccac gacgaggaga ttgccgcgtc gggcggggtg 
2221 ggcagcgtgg agttcgtgga cgtctaccac gcgttggacg gccacgagat cggctcggac 
2281 gagccgtggg tgaacggggt gcagttgcgg gacdcgcca ccggggtgac tgtggaccgc 
2341 agtaccttcc accccaacgc cgctgggcac cgggcggtcg glgagcgggt catcgagcag 
2401 atcgaaaccg gcccgggccg tccgctctat gcx^ctttcg cggtggtggc gggggcgacc 
2461 gtggacactc tcgcgggcga ggtggggtga cccggcttac cgtccggccc gcaggtctgc 
2521 gagcactgcg gcgatctggt ccactgccca gtgcagttcg tcttcggtga tgaccagcgg 
2581 cggggagagc cggatcgttg agccgtgcgf gtctttgacg agcacacccc gctgcaggag 
2641 ccgttcgcac agttctcttc cggtggccag agtcgggtcg acgtcgatcc cagcccacag 
2701 gccgatgctg cgggccgcga ccacgccgtt gccgaccagt tggtcgaggc gggcgcgcag 
2761 cacgggggcg agggcgcgga catggtccag gtaagggccg tcgcggacga ggctcaccac 
2821 ggcagtgcjcg accgcgcagg cgagggcgtt gccgccgaag gtgctgccgt gctggccggg 
2881 gcggatcacg tcgaagacft ccgcgtcgcc taccgccgcx; gccacgggca ggatgccgcc 
2941 gcccagcgct ttgcx^gaaca ggtagatatc ggcgtcgact ccgctgtggt cgcaggcccg 
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nGURE41 

SEQIDNo.36 

1 vgsgpraatr niflgipal vIvtaltM avptgrethv rmwceatqdw cigvpvdsrg 
61 qpaedgefll Ispvqaatwg nyyalgdsys sgdgardyyp gtavkggcwr sanaypelva 
121 eaydfaghls flacsgqrgy amidaidevg sqidwnspht sivtigiggn digfstvlkt 
181 cmvrvpllds kactdqedai rkrmakfett feelisevrt rapdarilw gyprifpeep 
241 Igayytltas nqnvlnetiq efnqqiaeav avhdeeiaas ggvgsvefvd vyhaWghei 
301 gsdepwvngvqirdlatgvtvdrstfhpna aghravgerv ieqietgpgr piyatfewa 
361 gatvdtlage vg 


FIGXJRE42 
SEQIDNo.37 

1 mrttviaasa llllagcadg areetagapp gessggiree gaeastsitd vyialgdsya 
61 amggrdqpir gepfdrssg nypellhaev tdltcqgavt gdlleprtig ertlpaqvda 
121 Itedttlvtl siggndlgfg evagcireri agenaddcvd llgetigeql dqippqidrv 
181 heairdragd aqvwtgyip Ivsagdcpel gdvseadrrw aveltgqine tvreaaerhd 
241 atfvlpddad ehtscappqq rwadiqgqqt dayplhptsa gheamaaavr dalglepvqp 
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FIGURE 43 


SEQIDNo. 38 

1 ttctggggtg ttatggggtt gttatcggct cgtcdgggt ggatcxcgcc aggtggggta 

61 ttcacggggg acttttgtgt ccaacagccg agaatgagtg ccctgagcgg tgggaatgag 
121 gtgggcgggg ctgtgtcgcc atgagggggc ggcgggctct gtggtgcccc gcgacccccg 
181 gccxcggtga gcggtgaatg aaatccggct gtaatcagca tcccgtgccc accccgtcgg 
241 ggaggtcagc gcccggagtg tctacgcagt cggatcctct cggactcggc catgctgtcg 
301 gcagcatcgc gctcccgggt cttggcgtcc ctcggctgtt ctgcctgctg tccctggaag 
361 gcgaaatgat caccggggag tgatacaccg gtggtctcat cccggatgcc cacttcggcg 
421 ccatccggca attcgggcag ctccgggtgg aagtaggtgg catccgatgc gtcggtgacg 
481 ccatagtggg cgaagatctc atcctgctcg agggtgctca ggccactctc cggatcgata 
541 tcgggggcgt ccttgatggc gtccttgctg aaaccgaggt gcagcttgtg ggcttccaat 
601 ttcgcaccacggagcgggac gaggctggaa tgacggccga agagcccgtg gtggacctca 
661 acgaaggtgg gtagtcccgt gtcatcattg aggaacacgc cctccaccgc acccagcttg 
721 tggccggagt tgtcgtaggc gctggcatcc agaagggaaa cgatctcata tttgtcggtg 
781 tgctcagaca tgatcttcct ttgctgtcgg tgtctggtac taccacggta gggctgaatg 
841 caactgttat ttttctgtta ttttaggaat tggtccatat cccacaggct ggctgtggtc 
901 aaatcgtcat caagtaatcc ctgtcacaca aaatgggtgg tgggagccct ggtcgcggtt 
961 ccgtgggagg cgccgtgccc cgcaggatcg tcggcatcgg cggatctggc cggtaccccg 
1021 cggtgaataa aatcattctg taaccttcat cacggttggt tttaggtatc cgccxx:tttc 
1081 gtcctgaccc cgtccccggc gcgcgggagc ccgcgggttg cggtagacag gggagacgtg 
1141 gacaccatga ggacaacggt catcgcagca agcgcattac tccttctcgc cggatgcgcg 
1201 gatggggcccgggaggagac cgccggtgca ccgccgggtg agtcctccgg gggcatccgg 
1261 gaggaggggg cggaggcgtc gacaagcatc accgacgtct acatcgccct cggggattcc 
1321 tatgcggcga tgggcgggcg ggatcagccg ttacggggtg agccgttctg cctgcgctcg 
1381 tccggtaattacccggaactcctccacgca gaggtcaccg atctcacctg ccagggggcg 
1441 gtgaccgggg atctgctcga accx:aggacg ctgggggagc gcacgctgcc ggcgcaggtg 
1501 gatgcgctga cggaggacac caccctggtc accctctcca tcgggggcaa tgacctcgga 
1561 ttcggggagg tggcgggatg catccgggaa cggatcgccg gggagaacgc tgatgattgc 
1621 gtggacctgc tgggggaaac catcggggag cagctcgatc agcttccccc gcagctggac 
1681 cgcgtgcacg aggctatccg ggaccgcgcc ggggacgcgc aggttgtggt caccggttac 
1741 dgccgctcg tgtctgccgg ggaclgcccc gaactggggg atgtctccga ggcggatcgt 
1801 cgttgggcgg ttgagctgac cgggcagatc aacgagaccg tgcgcgaggc ggccgaacga 
1861 cacgatgccc tclttgtcct gcccgacgat gccgatgagc acaccagttg tgcaccccca 
1921 cagcagcgct gggcggatat ccagggccaa cagaccgatg cctatccgct gcacccgacc 
1981 tccgccggcc atgaggcgat ggccgccgcc gtccgggacg cgctgggcct ggaaccggtc 
2041 cagccgtagc gccgggcgcg cgcttgtcga cgaccaaccc atgccaggct gcagtcacat 
2101 ccgcacatag cgcgcgcggg cgatggagta cgcaccatag aggatgagcc cgatgccgac 
2161 gatgatgagc agcacactgc cgaagggttg ttccccgagg gtgcgcagag ccgagtccag 
2221 acctgcggcc tgctccggat catgggccca accggcgatg acgatcaaca cccccaggat 
2281 cccgaaggcg ataccacggg cgacataacc ggctgttccg gtgatgatga tcgcggtccc 
2341 gacctgccct gaccccgcac ccgcctccag atcctcccgg aaatcccggg tggccccctt 
2401 ccagaggttg tagacacccg cx:cccagtac caccagcccg gcgaccacaa ccagcaccac 
2461 accccagggt tgggatagga cggtggcggt gacatcggtg gcggtctccc catcggaggt 
2521 gctgccgccc cgggcgaagg tggaggtggt caccgccagg gagaagtaga ccatggccat 
2581 gaccgccccc ttggcccttt ccttgaggtc ctcgcccgcx: agcagctggc tcaattgcca 
2641 gagtccx:agg gccgccaggg cgatgacggc aacccacagg aggaactgcc cacccggagc 
2701 ctccgcgatg gtggccaggg cacctgaatt cgaggcctca tcacccgaac cgccggatcc 
2761 agtggcgatg cgcaccgcga Iccacccgat gaggatgtgc agtatgccca ggacaatgaa 
2821 accacctctg gccagggtgg tcagcgcggg gtggtcctcg gcctggtcgg cagcccgttc 
2881 gatcgtccgt ttcgcggatc tggfgtcgcc cttatccata gctcccattg aaccgccttg 
2941 aggggtgggc ggccactgtc agggcggatt gtgatctgaa ctgtgatgtt ccatcaaccc 
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SEQIDNo. 39 


ZP 00094165 

1 mgqvklfarrcapvllalag lapaatvare aplaegaryv algssfaagp gvgpnapgsp 
61 ercgrgtlny phllaealkl divdatcsga tthhvlgpwn evppqidsvn gdtrlvtiti 
121 ggndvsfvgn ifiaaacekma spdprcgkwr eiteeewqad eermrsivrq iharaplarv 
181 vwdyitvlp psgtcaamai spdrtaqsrs aakriarita rvareegasi Ikfshisrrh 
241 hpcsakpwsn gisapaddgi pvhpnrlgha eaaaalvklv kimk 


FIGURE 45 
SEQDDNo. 40 

1 tgccggaact caagcggcgt ctagccgaac tcatgcccga aagcgcgtgg cactatcccg 
61 aagaccaggt ctcggacgcc agcgagcgcc tgatggccgc cgaaatcacg cgcgaacagc 
121 tctaccgcca gctccacgac gagctgccct atgaoagtac cgtacgtccc gagaagtacc 
181 tccatcgcaa ggacggttcg atcgagatcc accagcagat cgtgattgcc cgcgagacac 
241 agcgtccgat cgtgctgggc aagggtggcg cgaagatcaa ggcgatcgga gaggccgcac 
301 gcaaggaact ttcgcaattg ctcgacacca aggtgcacct gtlcctgcat gtgaaggtcg 
361 acgagcgctg ggcxjgacgcc aaggaaatct acgaggaaat cggcctcgaa tgggtcaagt 
421 gaagctcttc gcgcgccgct gcgccccagt acttctcgcc cttgccgggc tggctcx>ggc 
481 ggctacggtc gcgcgggaag caccgctggc cgaaggcgcg cgttacgttg cgctgggaag 
541 ctccttcgcc gcaggtccgg gcgtggggcc caacgcgccc ggatcgcccg aacgctgcgg 
601 ccggggcacg ctcaactacc cgcacctgct cgccgaggcg ctcaagctcg atctcgtcga 
661 tgcgacxtgc agcggcgcga cgaccxacca cgtgctgggc ccctggaacg aggttccccc 
721 tcagatcgac agcgtgaatg gcgacacccg cctcgtcacc ctgaccatcg gcggaaacga 
781 tgtgtcgttc gtcggcaaca tcttcgccgc cgcttgcgag aagatggcgt cgcccgatcc 
841 gcgctgcggc aagtggcggg agatcaccga ggaagaglgg caggccgacg aggagcggat 
901 gcgctccatc gtacgccaga tccacgcccg cgcgcctctc gcccgggtgg tggtggtcga 
961 ttacatcacg gtcctgccgc catcaggcac ttgcgctgcc atggcgattt cgccggaccg 
1021 gctggcccag agccgcagcg ccgcgaaacg gcttgcccgg attaccgcac gggtcgcgcg 
1081 agaagagggt gcatcgctgc tcaagttctc gcatatctcg cgccggcacc atccatgctc 
1 141 tgccaagccc tggagcaacg gcctttccgc cccggccgac gacggcatcc cggtccatcc 
1201 gaaccggctc ggacatgctg aagcggcagc ggcgctggtc aagcttgtga aattgatgaa 
1261 gtagctactg cactgatttc aaatagtatt gcctgtcagc tttccagccc ggattgttgc 
1321 agcgcaacag aaacttgtcc gtaatggatl gatggtttat gtcgctcgca aattgccgtc 
1381 gaagggaacg ggcgcgtcgc tcgttaacgt cctgggtgca gcagtgacgg agcgcgtgga 
1441 tgagtgatac tggcggtgtc atcggtgtac gcgccgccat tcccatgcct gtacgcgccg 
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FIGURE 46 

SEQIDNa41 
NP 625998 . 

1 mrrfrlvgfl sslvlaagaa Itgaataqaa qpaaadgyva Igdsyssgvg agsyisssgd 
61 ckrstkahpy hwaaahspstfdftacsgartgdvlsgqig plssgtgfvs isiggndagf 
121 adtmttcvlq sessclsria taeayvdsti pgkldgvysa isdkapnahv wigyprfyk 
181 igttdglse tkrlalnkas dhlntvlaqr aaahgftfgd vrttftghel csgspwlhsv 
241 nwlnigesyh ptaagqsggy Ipvlngaa 

// 

FIGURE 47 
SEQIDNo. 42 

1 cccggcggcc cglgcaggag cagcagccgg cccgcgatgt cctcgggcgt cgtcttcatc 
61 aggccgtcca tcgcgtcggc gacjcggcgcc gtgtagttgg cccggacctc gtcccaggtg 
121 ccx:gcggcga tctggcgggt ggtgcggtgc gggccgcgcc gaggggagac gtaccagaag 
181 cxx:atcgtca cgttclccgg ctgcggttcg ggctcgtccg ccgctccgtc cgtcgcctcg 
241 ccgagcacct tctcggcgag gtcggcgctg gtcgccgtca ccgtgacgtc ggcgccccgg 
301 ctccagcgcg agatcagcag cgtccagccg tcgccctccg ccagcgtcgc gctgcggtcg 
361 tcgtcgcggg cgatccgcag cacgcgcgcg ccgggcggca gcagcgtggc gccggaccgt 
421 acgcggtcga tgttcgccgc gtgcgagtac ggctgctcac ccgtggcgaa acggccgagg 
481 aacagcgcgt cgacgacgtc ggacggggag tcgctgtcgt ccacgttgag ccggatcggc 
541 agggcttcgt gcgggttcac ggacatgtcg ccatgatcgg gcacccggcc gccgcgtgca 
601 cccgctttcc cgggcacgca cgacaggggc tttctcgccg tcttccgtcc gaacttgaac 
661 gagtgtcagc catttdtgg catggacact tccagtcaac gcgcgtagct gctaccacgg 
721 ttgtggcagc aatcctgcta agggaggttc catgagacgt ttccgacttg tcggcttcct 
781 gagttcgcte gtcctcgccg ccggcgccgc cctcaccggg gcagcgaccg cccaggcggc 
841 ccaacccgcc gccgcx:gacg gctatgtggc cctcggcgac tcctactcct ccggggtcgg 
901 agcgggcagc tacatcagct cgagcggcga ctgcaagcgc agcacgaagg cccatcccta 
961 cctgtgggcg gccgcccact cgccctccac gttcgacttc accgcctgtt ccggcgcccg 
1021 tacgggtgat gttctctccg gacagctcgg cccgctcagc tccggcaccg gcctcgtctc 
1081 gatcagcatc ggcggcaacg acgccggttt cgccgacacc atgacgacct gtgtgctcca 
1141 gtccgagagctcctgcctgtcgcggatcgccaccgccgag gcgtacgtcg actcgacgct 
1201 gcccggcaag ctcgacggcg tctactcggc aatcagcgac aaggcgccga acgcccacgt 
1261 cgtcgtcatc ggctacccgc gcttctacaa gctcggcacc acdgcatcg gcctgtccga 
1321 gaccaagcgg acggcgatca acaaggcctc cgaccacctc aacaccgtcx; tcgcccagcg 
1381 cgccgccgcc cacggcttca ccttcggcga cgtacgcacc accttcaccg gccacgagct 
1441 gtgctccggc agcccctggc tgcacagcgt caactggctg aacatcggcg agtcgtacca 
1501 cccxaccgcg gccggccagt ccggtggcta cctgccggtc ctcaacggcg ccgcctgacc 
1561 tcaggcggaa ggagaagaag aaggagcgga gggagacgag gagtgggagg ccccgcccga 
1621 cggggtcccc gtccccgtct ccgtctccgt cccggtcccg caagtcaccg agaacgccac 
1681 cgcgtcggac gtggcccgca ccggactccg cacctccacg cgcacggcac tctcgaacgc 
1741 gccggtgtcg tcgtgcgtcg tcaccaccac gccgtcctgg cgcgagcgct cgccgcccga 
1801 cgggaaggacagcgtccgccacccx:ggatcggagaccgacccgtccgcgg tcacccaccg 
1861 gtagccgacc Iccgcgggca gcx:gcccgac cgtgaacgtc gcxgtgaacjg cgggtgcccg 
1921 gtcgtgcggc ggcggacagg cccccgagta gtgggtgcgc gagcccacca cggtcacctc 
1981 caccgactgc gctgcggggc 
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FIGURE 48 
SEQIDNo. 43 
NP 827753 . 

1 mrrsritayv tslllavgca Itgaataqas paaaatgyva Igdsyssgvg agsylsssgd 
61 ckrsskaypy Iwqaahspss fsfmacsgar tgdvlanqlg tinsstgivs Itiggndagf 
121 sdvmttcvlq sdsaclsrin takayvdsti pgqidsvyta istkapsahv avlgyprfyk 
181 Iggsclagis etkrsainda adylnsaiak raadhgftfg dvkstftghe icssstwihs 
241 Idllnigqsy hptaagqsgg yipvmnsva 

// 

FIGURE 49 
SEQIDNo. 44 

1 ccaccgccgg gtcggcggcg agtctcctgg cctcggtcgc ggagaggttg gccgtgtagc 
61 cgttcagcgc ggcgccgaac gtcttcttca ccgtgccgcc gtactcgttg atcaggccct 
121 tgcccttgct cgacgcggcc ttgaagccgg tgcccttctt gagcgtgacg atgtagctgc 
1 81 ccttgatcgc ggtgggggag ccggcggcga gcaccgtgcc ctcggccggg gtggcctggg 
241 cgggcagtgc ggtgaatccg cccacgaggg cgccggtcgc cacggcggtt atcgcggcga 
301 tccggatctt cttgctacgc agctgtgcca tacgagggag tcclcctctg ggcagcggcg 
361 cgcctgggtg gggcgcacgg ctgtgggggg tgcgcgcgtc atcacgcaca cggccctgga 
421 gcgtcgtgtt ccgccctggg ttgagtaaag cctcggccat ctacgggggt ggctcaaggg 
481 agttgagacc ctgtcatgag tctgacatga gcacgcaatc aacggggccg tgagcacccc 
541 ggggcgaccc cggaaagtgc cgagaagtct tggcatggac acttcctgtc aacacgcgta 
601 gctggtacga cggttacggc agagatcctg ctaaagggag gttccatgag acgttcccga 
661 attacggcat acgtgacctc actcctcctc gccgtcggct gcgccctcac cggggcagcg 
721 acggcgcagg cgtccccagc cgccgcggcc acgggctatg tggccctcgg cgactcgtac 
781 tcgtccggtg tcggcgccgg cagctacctc agctccagcg gcgactgcaa gcgcagttcg 
841 aaggcctatc cgtacctctg gcaggccgcg cattcaccct cgtcgttcag tttcatggct 
901 tgctcgggcg ctcgtacggg tgatgtcctg gccaatcagc tcggcaccct gaactcgtcc 
961 accggcctgg tctccctcac catcggaggc aacgacgcgg gctlctccga cgtcatgacg 
1021 acctgtgtgc tccagtccga cagcgcctgc ctctcccgca tcaacacggc gaaggcgtac 
1081 gtcgactcca ccctgcccgg ccaactcgac agcgtgtaca cggcgatcag cacgaaggcc 
1 141 ccgtcggccc atgtggccgt gctgggctac ccccgcttct acaaactggg cggctcctgc 
1201 ctcgcgggcc tctcggagac caagcggtcc gccatcaacg acgcggccga ctatctgaac 
1261 agcgccatcg ccaagcgcgc cgccgaccac ggcttcacct tcggcgacgt caagagcacc 
1321 ttcacx;ggcc atgagatctg ctccagcagc acctggctgc acagtctcga cctgctgaac 
1381 atcggccagt cctaccaccc gaccgcggcc ggccagtccg gcggctatct gccggtcatg 
1441 aacagcgtgg cctgagctcc cacggcctga atttttaagg cctgaatttt taaggcgaag 
1501 gtgaaccgga agcggaggcc ccgtccgtcg gggtctccgt cgcacaggtc accgagaacg 
1561 gcacggagtt ggacgtcgtg cgcaccgggt cgcgcacctc gacggcgatc tcgttcgaga 
1621 tcgttccgct cgtgtcgtac gtggtgacga acacctgctt ctgctgggtc tttccgccgc 
1681 tcgccgggaa ggacagcgtc ttccagcccg gatecgggac ctcgcccttc ttggtcaccc 
1741 agcggtactc cacctcgacc ggcacccggc ccaccgtgaa ggtcgccgtg aacgtgggcg 
1801 cctgggcggt gggcggcggg caggcaccgg agtagtcggt gtgcacgccg gtgaccgtca 
1861 ccttcacgga ctgggccggc ggggtcgtcg tacx^gccgcc gcx:accgccg cctcccggag 
1921 tggagcccga gctgtggtcg cccccgccgt cggcgttgtc gtcctegggg gttttcgaac 

// 
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SEQIDNo. 45 

MRLTRSLSAASVIVFALLLALLGISPAQAAGPAYVALGDSYSSGNGAGSYIDSSGDCHRSN 

NAYPARWAAANAPSSFrFAACSGA\^TDVINNQL6ALNASTGLVSmGGNDAGFADA^^ 

CVrSSDSTajSlRLATAThr^lNrrrUJU^LDAW 

LGLS^^XRAAI^rrTADTLNS^/^SSRATAHGFRFGDVRPTFNNHEIJ^FGNDWmSLT^ 
SYHPTSTGHQSGYLPVLNANSST 

FIGURE 51 
SEQIDNo. 46 

ACAGGCCGATGCACGGAACCGTACCTTTCCGCAGTGAAGCGCTCTCCCCCCATCGTTCGC 

CGGGACTTCATCCGCGATTTTGGCATGAACACTTCCTTCAACGCGCGTAGCTTGCTACAA 

GTGCGGCAGCAGACCCGCTCGTTGGAGGCTCAGTGAGATTGACCCGATCCGTGTCGGCCG 

CATCGGTCATCGTCTTCGCCCTGCTGCTCGCGCTGCTGGGCATCAGCCGGGGGCAGGCAG 

CCGGCCCGGCCTATGTGGCCCTGGGGGATTCGTATTCCTCGGGCAACGGCGCCGGAAGTr 

ACATCGATTCGAGCGGTGAGTGTGACCGCAGCAACAACGCGTACCGGGCCCGCTGGGCGG 

CGGCCAACGGACCGTCCTCGTTCAGGTTCGGGGCCTGCTCGGGAGGGGTGACCACGGATG 

TGATCAACAATGAGCTGGGGGCCCTCAACGCGTCCACCGGCCTGGTGAGCATCACCATCG 

GCGGCAATGACGCGGGCTTCGCGGACGCGATGACCACCTGCGTCAGCAGCTCGGACAGGA 

CCTGGCTCAAGCGGCTGGCCACCGCCAGGAACTAGATCAACACCACCCTGCTCGGGCGGC 

TCGAGGGGGTGTAGAGCCAGATCAAGGGGGGTGCCCCCAACGGCCGCGTGGTCGTGCTCG 

GCTAGCGGGGGATGTACGTGGCCTCGAAGGCCTGGTAGTGGCTGGGCCTGAGCAACAGGA 

AGCGGGCGGCGATGAACACCACCGCCGACACCCTCAACTCGGTGATCTCCTGCCGGGCCA 

ccgcccacggattccgattcggcgatgtccgcccgaccttcaacaAccacgaactgttgt 

tcggcaacgactggctgcactcagtcaccgtgccggtgtgggagtcgtaccaccccacca 

gcacgggccatcagagcggctatctgccggtcctcaacgccaacagctcgacctgatcaa 

cgcacggccgtgcccgccccgcgcgtcacgctcggcgcgggcggggcagcgcgtrgatca 

gcccacagtgccggtgacggtcccaccgtcacggtcgagggtgtacgtcacggtggcgcc 

gctccagaagtggaacgtcagcaggaccgtggagccgtccctgacgtcgtcgaagaactc 

cggggtcagcgtgatcacccctcccccgtagccgggggcgaagggggcgccgaactcctt 

gtaggacgtccagtcgtgcggcccggcgttgccaccgtccgcgtagaccgcttccatggt 

cgcgagccggtccccgcggaactcggtggggatgtccgtgcccaaggtggtcccggtggt 

gtccgagaggaccggggggtcgtaccggatgatgtgcagatccaaagaatt 
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